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Preface 


This book introduces methods for identifying and evaluating alternative plans and 
policies that address public sector issues and problems. This is not a book on the 
analysis of policy-making processes but rather methods of analyzing the plans and 
policies themselves. Such methods aid in the planning and analysis of systems 
that provide the services the public desires and that policymakers are responsible 
for providing and managing. The modeling methods are the common ones found 
in numerous textbooks and courses on operations research and management sci- 
ence. These methods have been used to address development and management 
opportunities and issues in many disciplines including those within agriculture, 
business, ecology, economics, engineering, health, management, military science, 
and natural resources, to mention a few. 

So why this book? First, it is intended to be an introduction to the development 
and use of various types of optimization, simulation, and related systems anal- 
ysis methods at a level that does not require the traditional prerequisite courses 
in calculus, computer programming, probability or statistics, or in a particular 
application area discipline. Yet aspects of these disciplines are indeed useful to 
effectively model various problems and issues. They will be introduced and used 
when needed. Secondly, the emphasis in this book is on the art of converting a 
verbal description or conceptual model of a system into a mathematical one. Con- 
ceptual models may be expressed only qualitatively in words or as a node-link 
network diagram representing interacting components. Developing and solving 
mathematical models allow one to estimate and compare quantitatively the var- 
ious physical, economic, ecologic, or social impacts that may result from various 
decisions. Thirdly, the focus in this book is on modeling designed to inform those 
in the public sector who are managing public systems and dealing with any con- 
flicting opinions or conflicts over their design, operation, and/or impacts. The book 
does this through the use of various example problems, often showing how differ- 
ent methods can be used to analyze the same problem or system, demonstrating 
the advantages and limitations of each modeling approach. 

Thus, this book describes how to quantify and model various policy problems 
and obtain and evaluate alternative solutions, based on various criteria, including 
political ones. It is about performing analyses for policy, not of policy. It is aimed 
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at methods for providing useful information to those responsible for making policy 
decisions. Modeling systems is an art, and to become a better artist takes practice. 
This book provides an opportunity to begin developing this skill. While solving 
models can be a straightforward process, developing and applying them to inform 
public policymakers is not. Model building and implementation is an art. How best 
to do it in each case is highly dependent on the problem being addressed, the data 
and time available, and on the institutional policy-making environment. 

The contents of this book have been included in courses offered to professional 
master’s degree students in the Institute of Public Affairs, within the Brooks School 
of Public Policy, at Cornell University. I owe it to all these students for suggesting 
both content and improvements over the past years. Using some modeling jargon, 
one of my objectives while writing these chapters was to minimize errors. If you 
find any, or have any suggested improvements and modifications, I will be most 
grateful if you would let me know. 

Finally, I hope you find modeling and solving systems planning and policy 
problems as much fun as I do. Who knows, someday you may get paid for doing 
it, or you may be supervising, or being informed by, those who are. One advantage 
of having some modeling skills is that they are widely applicable and hence are 
increasingly being used to improve the performance of systems in both the private 
and public sectors. The demand for those with these skills can only increase. 


Ithaca, NY, USA Daniel P. Loucks 
October 2021 
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Analyzing Public Policy Decisions 


ABSTRACT 


An introduction to the contents of the book that focus on the art of building 
and using various optimization and simulation modeling methods for analyzing 
public systems planning and management issues. Emphasized is on the use 
of models for informing policy makers and for assisting in decision making 
processes. 


1.1 Introduction 


This book introduces the art of building and using models for analyzing public 
systems planning and management issues. These deterministic and probabilistic 
optimization and simulation models provide a means of identifying possible ways 
of addressing various policy problems and evaluating them based on their physi- 
cal, economic, environmental, and social impacts. While the problems we address 
to illustrate the application of various mathematical modeling tools will likely 
differ from the ones you may have to deal with in your future jobs, they serve 
to help develop your skills in applied systems analysis. Such skills should help 
you analyze and identify solutions to both well and poorly defined public systems 
planning and management problems. Typically, such problems have many possible 
solutions and the best ones, especially given multiple goals and uncertain data, are 
not obvious. 

The purpose of the quantitative and qualitative methods for managing data dis- 
cussed in this book is to inform those responsible for decision-making. They can 
help decision-makers estimate the potential impacts of the decisions they might 
make. These methods cannot determine what decisions are best, but they may 
help in determining which are better than others. What is best will depend on 
many factors, including those not considered in any mathematical modeling exer- 
cise. Different assumptions can lead to different preferred policy decisions. These 
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assumptions can range from just what is to be accomplished by a proposed deci- 
sion and how those impacted will react, to details such as what interest rate may 
have to be paid on loans 20 years from now. It is the decision-maker who must 
decide which goals or objectives to consider and which assumptions about how 
a system functions are most reasonable. The aim of all of this ‘systems analy- 
sis technology’ is to help us generate and communicate ‘what if? information to 
decision-makers in ways that result in more informed decision-making. 

Working in the public sector, including non-governmental organizations, can 
offer many benefits: a sense of purpose and the opportunity to serve the public and 
improve the quality of the lives of those of us living in this world. For those hav- 
ing this opportunity, it will undoubtedly involve participation in decision-making 
processes. Decision-making by public officials establishes programs and policies 
that can have a significant impact on our lives and on our environment. Govern- 
ments make decisions. That is what they are supposed to do. From local decisions 
to federal or international decisions, the impact of public sector decision-making 
on the lives of people can be significant. Many organizations in the private sector 
have been benefiting from the use of systems analysis tools for over 5 decades, 
the military for over 8 decades. Public sector uses have been more recent, but no 
less useful. 

Public sector decisions just like those in other sectors, and indeed the decisions 
we make in our own lives, are influenced by many factors. Many are made with- 
out the benefit of any mathematical modeling. But such models can contribute 
useful insights on what is technically possible and on what is economically, or 
environmentally, or ecologically, or socially preferred based on various perfor- 
mance criteria. The use of models as aids to decision-making has been growing in 
the environmental, natural resources, agriculture, energy, urban planning, and pub- 
lic health decision-making areas, to mention only a few. As more become familiar 
with both the advantages as well as limitations of systems analysis methods applied 
by competent analysts to public sector issues, its use will continue to spread and 
lead to more informed decision-making throughout the public sector. 


1.1.1 Historical and Other Perspectives 


Systems modeling approaches have existed for well over a half a century with 
early applications in the biological and ecological disciplines (von Bertalanffy, 
1950, 1968). Development and use of systems modeling approaches during the 
Second World War advanced the fields of systems analysis, systems engineering, 
and operations research, all of which involve developing and applying optimiza- 
tion, simulation, and statistical models of multicomponent systems. Operations 
Research (OR) is the name given to a discipline that focuses on the use of mathe- 
matical modeling and statistical analysis of decisions on the deployment of the 
resources under an organization’s control. Systems analysis was developed by 
RAND Corporation in 1948. It broadened and extended OR. In 1961, the Kennedy 
Administration in the US decreed that systems modeling methods, combining OR 
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with cost-benefit and cost-effectiveness analyses, should be used throughout the 
government to provide a quantitative basis for broad decision-making (Enthoven, 
2021). 

At the international level, the International Institute for Applied Systems Anal- 
ysis (IIASA) has been successfully providing policy-relevant analysis tools and 
information pertaining to the management of food and forests, energy, ecosystems 
and the environment, population growth, and water resources management among 
other issues since its founding in 1972. Systems analysis tools are commonly used 
in all the UN organizations and the World Bank. Mostly at the national level, 
RAND Corporation has been doing the same, but their reach and impact have 
been at the international level as well. Since 1948 RAND has been developing 
and using systems analysis methods to meet its goal of identifying solutions to 
public policy challenges to help make communities throughout the world safer 
and more secure, healthier, and more prosperous. 

The increase in computing power following the War along with advances in 
algorithms (mathematical procedures) for solving models has made it possible to 
design and analyze increasingly larger systems, often involving thousands of vari- 
ables and equations. The availability of computers and software programs that can 
solve models allows us in the application disciplines to focus on the art of model 
development and use. Just like painters and musicians and actors, the only way to 
develop skills in the art of modeling is to practice. This book has been prepared to 
assist readers in doing that. But even if one doesn’t become a systems analyst or 
modeler, being exposed to the material in this book will help give one an appre- 
ciation of the benefits and limitations of using models to better understand and 
manage systems. They will be better able to understand and work with those who 
do. 

Quantitative models of systems generally rely on predefined goals and causal 
relationships. Since the late 1970s, soft systems approaches have emerged in 
response to the challenges faced by modelers in the social world (Checkland, 
1999a, 1999b). Soft systems methodology is more qualitative. It is used to gain 
insight into the decision-making and planning processes and in defining conceptual 
system diagrams before introducing the mathematics. Soft systems methodology 
begins by asking what the objectives are, which of course can change over time. 
Hard systems approaches analyze the system in search of alternatives that satisfy 
the desired objectives. These approaches can include qualitative as well as quanti- 
tative modeling approaches. Chapter 17 of this book presents one way to convert 
qualitative statements to quantitative expressions suitable for incorporating into 
models. 

To be clear, these modeling approaches are not problem-free. They are all based 
on assumptions, and they cannot distinguish between good assumptions and bad 
or incorrect ones. They synthesize but fail to innovate. They fail to suggest new 
ideas that may not have been considered when creating a conceptual model of a 
system. The solution of models cannot identify what to include in a system or 
in what detail. This is one reason why modeling is an art, not a science. Even 
when modeling physical or biological processes—the science—it is a matter of 
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judgment as to what detail is needed to inform the decisions being considered. 
Again, modeling is an art and different analysts can differ on what they consider 
to be the best modeling approach. As stated by George E. P. Box: 


All models are approximations. All models are wrong but some are useful. However, the 
approximate nature of the model must always be borne in mind. 


If models cannot innovate, the question is how can they help humans innovate. 
One approach is to incorporate models within an interactive participatory model- 
ing framework. Using participatory systems methods that include humans in the 
modeling loop, innovation may be possible. Models can generate scenarios that 
may suggest new ideas, i.e., motivate human innovation. Such models have been 
increasingly applied in the field of natural resources (van den Belt et al., 2010; 
Voinov et al. 2018). 

In practice, most systems analysts use a multitude of methods. For example, 
different optimization models may be employed to narrow down the number of 
alternative plans or policies to be examined in greater detail using simulation mod- 
els. You will be introduced to both types of modeling approaches in the chapters 
that follow. You will learn that each type of model has its strengths and limita- 
tions. There is no single best modeling approach for all analyses and problems. 
Each modeling approach or type has its advantages and limitations. This will 
become evident as you are introduced to the different types of models and com- 
puter software (e.g., in Excel) used to solve the modeling problems presented in 
this book. 


1.2 Modeling Policy Issues 


Modeling and model outputs can help focus policy-making debates. This does not 
imply that the decision-making processes mimic best accepted modeling proce- 
dures. A decision-making framework, where first data are collected, next policy 
objectives are defined, then alternative policies that meet these objectives are iden- 
tified, analyzed, and evaluated, perhaps using some of the methods introduced in 
this book, and finally, a choice that maximizes some combination of social welfare 
(or minimizes political risk) indicators is made, rarely works in practice. For var- 
ious reasons, this logical systematic modeling framework does not represent the 
reality of most policy-making processes (Fig. 1.1). 

Policy problems not only have an analytical dimension but also a normative 
value-based one. Public policymakers need to find acceptable practical compro- 
mise solutions to problems or issues that are acceptable to all participants. Often 
there may be no such obvious solutions. These so-called ‘wicked’ problems are 
hard to define, let alone address using models. Thus, inevitably their resolution 
is temporary, tentative, and dependent on political judgments possibly informed 
by the results of models of those aspects of the problem that can be modeled. 
This distinction between the analytical approach to the discovery of knowledge 
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and policy-making does not make it impossible for analysts and policymakers to 
work together to better inform the policy-making process. But it is not always 
easy. While policy decisions can certainly be made without being informed by 
any analyses of alternatives, the added value of policy informed by such analyses 
suggests they are worth performing. 

There are a variety of modeling approaches that can be useful tools for inform- 
ing policymakers. Models used to inform policy are built and solved to provide 
information that can help policymakers develop insights on which they can base, 
at least in part, their policy decisions. The usefulness of such ‘policy modeling’ 
is judged not by how accurately it reflects the real world, but by how well it is 
able to provide information that enables a policymaker to make knowledgeable 
choices among policy options—i.e., how well the modeling can help construct and 
defend arguments about the relative pros and cons of alternative policy options. A 
relatively crude model that can clearly demonstrate that alternative ‘A’ performs 
better than alternative ‘B’ under both favorable and unfavorable assumptions will 
probably lead to a better decision than a complex model that can perform only a 
detailed expected-value estimation. 

Policy models trade off rigor for relevance. In some cases, they can be used 
for screening large numbers of alternative policy options, comparing the outcomes 
of the alternatives, and/or designing strategies considering a wide range of factors 
(e.g., technical, financial, or social), but not a lot of detail about each factor. The 
outcomes are generally intended for comparative analysis (i.e., relative rankings) of 
policy alternatives. Approximate results are often sufficient to map out the decision 
space—the ranges of values of the various input parameter values for which each 
of the various policy options would be preferred. 


1.3 Complexity 


In today’s highly interconnected societies and economies, policymakers address- 
ing one issue must consider the impacts of their decisions not only on the issue 
being addressed but also if and how those decisions may impact other aspects of 
society over time. We are all living in a multicomponent environment and dealing 
with multicomponent systems, as illustrated in Fig. 1.2. Hence, taking a systems 
approach to managing them makes sense. A systems approach focuses on the per- 
formance of the system as a whole, not of each component separately. How one 
component of a system is designed and managed may impact the performance of 
one or more other components of that system or even of other systems. These pos- 
sibilities are worth being identified and evaluated, ideally before policy decisions 
are made. Better to prevent major problems or crises than to deal with their con- 
sequences although politicians, and indeed most of us, probably get more credit 
and fame from solving crises than from preventing them. 

The more complex the issue is, the more likely some application of sys- 
tems analysis methods may be helpful when considering ways of addressing the 


6 1 Analyzing Public Policy Decisions 


Fig.1.1 A theoretical sequential modeling process on the left looks like a water cascade. The 
modeling process in practice on the right has many possible feedbacks requiring model modifi- 
cations or even having to begin parts of the process over again 


Fig.1.2 Conceptual model 
of an interdependent 
interacting multicomponent 
system 


issue. What is a complex issue? Factors characterizing complex issues include the 
following: 


existence of multiple criteria (outcomes you want any decision to achieve); 
many possible alternative decisions and the ‘best’ is not obvious; 
significant uncertainty in the outcome of any decision; 

competing viewpoints or goals among decision-makers and/or stakeholders; 
conflicting criteria (e.g., improving one outcome worsens another); 
significant (size or time frame) impacts associated with any decision; and 
decision outcomes that will impact many people and are 

hard to modify or adapt to changing criteria or conditions over time. 


Certainly, there are many public policy decisions that have these characteristics. 
For example, consider the fossil fuel industry’s argument that production and 
pipeline transport contributes to job creation and economic benefits. Position- 
ing themselves as friends of working people, they counter those concerned about 
potential environmental damage and global warming by arguing that they are pro- 
tecting oil and gas workers’ livelihoods. It happens often. A company proposes 


1.5 Book Outline 7 


some big project, environmentalists oppose it. Or a government agency proposes 
new regulations intended to reduce air pollution. Public health experts say it will 
improve human health and reduce premature deaths, but unions say it will destroy 
jobs. These are examples of conflict of criteria. 

Decisions that have multiple conflicting criteria and many alternatives are dif- 
ficult to make. These are examples of public policy issues for which systems 
analysis methods can help identify and evaluate the consequences of alternative 
policy decisions that could be taken to address them. 


1.4 Are You Ready? 


Decision sciences are typically taught in engineering, mathematics, and economics 
departments including business schools. Because of their wide applicability, they 
are increasingly being offered in public affairs programs as well. To reassure those 
who may not have quantitative backgrounds, you do not need a mathematics or 
engineering or economics background to learn how to use the tools presented 
in this introductory text. All that is expected and assumed is some proficiency 
in algebra. The emphasis in this book is on learning the art of developing and 
solving models that address particular public policy issues. Having these skills 
can only benefit you as an employee in a public service organization. What public 
organization does not need to analyze data, make decisions, and interact with those 
having a large diversity of backgrounds and expertise in law, engineering, the 
natural sciences, the social sciences, and in communications, to mention a few? 


1.5 Book Outline 


Chapter 2 offers some insights into public systems and how models of such sys- 
tems may help inform those responsible for their design and operation. The advice 
offered in Chap. 2 is backed up with some case studies involving the application 
of modeling and factors that contributed to their success or failure. Chapters 3 and 
4 begin the introduction to developing and solving models applied to some simple 
policy and infrastructure design problems. 

Many of the models used to address policy and infrastructure issues include 
economic functions that define benefits and costs over time. Chapter 5 reviews the 
methods used to compute present and future and annual benefits and costs, and 
the influence of inflation and taxes on how we manage our personal as well as 
public investments. Those who develop models do so in part because they assume 
they can be solved. Many of the modeling examples used in this book to illustrate 
different modeling approaches can be solved on a computer using Excel. Chapter 6 
reviews how to apply the Solver component of Excel to solve a wide variety of 
optimization models. 
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Chapters 7, 8, and 9 focus on constrained optimization modeling, again using 
policy and infrastructure issues as example problems to model and solve. Chap- 
ters 10 and 11 introduce ways calculus can be used to analyze problems that are 
characterized by continuous non-linear functions. These chapters are written for 
those not yet familiar with calculus and how slopes (marginal values) of functions 
are derived and used to find optimal solutions. Issues such as the privatization of 
public utilities and the impacts that may follow such decisions are addressed using 
these calculus-based methods. 

Chapters 12 and 13 introduce ways of dealing with uncertainty when developing 
optimization or simulation models. They review the basics of probability and statis- 
tics and introduce stochastic processes and how such processes can be included 
within models applied to various public policy issues. Chapter 14 describes how 
reliabilities associated with relationships within systems can be considered and 
introduces methods for generating values of random variables and how they can 
be applied in simulation models. 

Simulation modeling is introduced in Chap. 15, again through its application 
to policy and infrastructure planning problems, taking advantage of the infor- 
mation presented in previous chapters. The chapter serves to reinforce many of 
the modeling and solution approaches covered throughout the book. Chapter 16 
addresses situations where multiple system performance criteria or goals exist, 
and some of them may conflict with others. In such cases, tradeoffs among the 
values of multiple performance measures can be identified using various model- 
ing and other analysis approaches reviewed in this chapter, thereby informing the 
political negotiation process as it attempts to identify the most preferred policy or 
plan. 

The book concludes with an introduction on how to include qualitative expres- 
sions of goals or constraints in optimization and simulation models. Chapter 17 
explains how qualitative expressions of economic, environmental, and social con- 
cerns can be considered along with the system conditions that can be expressed 
quantitatively. Final Chap. 18 briefly summarizes the role this modeling plays 
in the political decision-making process where public policies and infrastructure 
plans are approved and implemented. 


1.6 Conclusion 


‘Data-driven decision-making’ and “evidence-based decision-making’ are popular 
topics these days, especially as a counterweight to the misinformation that seems to 
influence many aspects of today’s public sector decision-making. The terms refer 
to the analyses of observed scientific data to inform decision-making processes. 
The keyword is ‘to inform’. Experienced decision analysts addressing public pol- 
icy challenges recognize that no analysis, including their own, can by itself tell 
one what the best decision is pertaining to a particular public issue. Analyses are 
always limited in what they include or consider. 
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Nevertheless, analyses can provide insights about potential outcomes and uncer- 
tainties and clarify what the implications may be of any decision or action taken 
regarding a particular issue or problem. Applying these tools could very well 
increase the probability of achieving agreements among stakeholders, or at least 
elucidate the causes of disagreements that may exist. As mentioned earlier, they 
may also help identify new, preferred alternatives. These tools can also be used to 
help people outside of the decision process better understand why an alternative 
policy was selected. In sum, modeling approaches can provide structure, consis- 
tency, transparency, and understanding about public sector decisions, which would 
benefit the public as well as the decision-makers. 


Exercises 


em 


Why develop and use models? 

2. Under what conditions is modeling potentially useful to managers (decision- 
makers)? 

3. Develop a conceptual network representation of the interdependence among our 

water, land, energy, climate, and socio-economic systems. 
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Public Sector Systems 


ABSTRACT 


A discussion of the nature of public systems and their management. Examples 
of public systems and the services they provide show how complicated and 
complex they can be, and the challenges analysts have in providing informa- 
tion useful to those responsible for providing and managing them. Case studies 
involving modeling to improve system performance are briefly described as are 
the lessons learned from them. 


2.1 Introduction 


Let us begin with some definitions. Each discipline has its jargon, and the deci- 
sion sciences and systems analysis disciplines are no exception. Probably the most 
common term used in this book on public policy modeling is the term ‘system’. 
For us, a system refers to a set of interdependent components that work together to 
accomplish the desired outcome. Wikipedia defines a system as a group of inter- 
acting or interrelated elements that act according to a set of rules to form a unified 
whole. A system, surrounded and influenced by its environment, is described by 
its boundaries, components, structure and purpose. 

What distinguishes systems analyses from other analysis exercises is their focus 
on the performance of the system as a whole, rather than on each of the system’s 
components separately. They address the question of how each component, say 
of an urban transportation system or a community public school system, should 
be designed and operated to provide the maximum net benefits, however, mea- 
sured, derived from the system. Determining just what is included in the system, 
as opposed to its environment, and how to describe that system in mathematical 
expressions, is part of the art of systems modeling, an art that this book introduces. 

There are many types of systems, of course, and in this case, we are primarily 
interested in those in the public sector, such as those managed by governmental 
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agencies or non-governmental organizations. Figure 2.1 illustrates a public health 
system, where depending on the issues being addressed, each of the components 
could be a system of interacting components itself. Most systems are systems of 
systems. It is up to the analyst to define the appropriate detail to include in each 
component of any model depending on the issues being addressed, and the data 
and time available for the study, among other factors. 

This public health system is just one sub-system of any urban system, even 
relatively small ones as shown in Fig. 2.2. 

The systems referred to in Figs. 2.1 and 2.2 are obviously both complicated 
and complex. There are many possible ways of designing and managing them 
and many possible measures of performance. Furthermore, given any decision, 
the results are not always predictable. The purpose of this book is to introduce 
some tools that may help identify, analyze, and evaluate the estimated impacts 
of alternative system design or management policies that one could face working 
in public or non-governmental organizations. Such information should be helpful 
to anyone having to decide what decision to make or what course of action to 
take to address a particular issue or problem. Depending on the problem or issue 
being addressed, the possible impacts of any decision may be physical (including 
medical), economic, environmental or ecological, political, and/or social. Models 
can be developed and used to estimate any or all these impacts, as appropriate. It is 
up to those developing and using models to decide what to include in any analysis 
and what information is needed to best inform those involved in the decision- 
making processes. 


Government 


Law G 
Enforcement © Ò Q 
| mem 


Fig.2.1 A public health system of interacting interdependent components 
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Fig. 2.2 A community consisting of interacting systems that provide the educational, environmen- 
tal, public health, recreational, social, and transportation services people need and expect 


2.1.1 Managing Public Systems 


Some public agencies are using systems approaches to successfully manage com- 
plicated issues (e.g., banking regulation, trade treaties, community transportation, 
and healthcare systems). Such systems may have many components and uncertain- 
ties, but it is possible to understand how each of these systems can be designed 
and managed to achieve specified goals. However, the nature of other public sec- 
tor problems, frequently referred to as wicked or messy ones, are more difficult 
to assess and, therefore, are more challenging to manage. Rather than having dis- 
crete components linked together in ways that are clear, often the functioning of 
components as well as their interactions with others in public systems are not 
clear. For example, it might be hard to establish whether the reduced use of plas- 
tic is a result of improved industrial packaging, changing consumer habits, or 
stricter plastic disposal controls. Policy decisions for such wicked systems can 
have unintended consequences. For example, the construction of a simple road 
overpass in Somerville, Massachusetts—which was needed from an infrastructure 
development perspective—led to a rise in childhood obesity rates due to part of 
the community being cut off from leisure and sporting facilities (Curtatone & 
Esposito, 2014). 

Systems approaches have been usefully applied in a variety of public policy 
fields. For example, 


e Childhood obesity and social policy in Australia (Allender et al., 2015; Canty- 
Waldron, 2014). 
e Child protection in England (Lane et al., 2016). 
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e Design/management of children’s services in England and Wales (Gibson & 
O’ Donovan, 2014). 

e Health issues including obesity, tobacco use, and mental health services in 
North Wales (Evans et al., 2013) and public health more generally (WHO, 
2009). 

e Higher education in the United Kingdom (Dunnion & O’ Donovan, 2014). 

e Environmental issues in Sweden (Lundberg, 2011), waste oil management in 
Finland (Kapustina et al., 2014), and sustainable food consumption in Norway. 

e Infrastructure planning in Australia (Pepper et al., 2016). 

e Military and political affairs in the United States (de Czege, 2009). 

e Energy production and ecosystem preservation in South East Asia (Thomas 
et al., 2017). 


In complex systems, cause and effect may only be obvious in hindsight, high- 
lighting the need for different analytical tools that together can identify and 
evaluate adaptive policies and produce a better understanding of how particular 
systems function. It is important to understand the systems being analyzed and not 
underestimate the possibility of being surprised. 

Few would disagree that the public policy world of today can be volatile, 
uncertain, complex, and ambiguous. Solutions proposed to address problems or 
opportunities are often strongly contested. Not everyone has the same goals or 
desires. Therefore, many policies developed to address problems fail because of a 
lack of sufficient political support or from unforeseen side effects or difficulties in 
communication, coordination, and monitoring. The challenge for systems analysts 
is, therefore, to generate meaningful (and useful) policy options that can adapt to 
future surprises and conditions that are today unknowable, while satisfying today’s 
goals and needs. To introduce more jargon, some call such policies robust. 

The process of solving a problem involves understanding the nature of that 
problem. Those advising policymakers have a collective responsibility to collect, 
verify, and synthesize information in pursuit of a more coherent and complete 
knowledge, say for ‘what can be done about x’. However, no amount of modeling 
and data analyses will answer political or normative questions like ‘what should be 
done about x’. That is a political decision. But again, models and data can inform 
those who make such decisions. 


Politics is more difficult than physics. 
Albert Einstein 
(Conference in Princeton, N.J. January 1946.) 


Public systems modelers will be working in a political environment and will likely 
find that more challenging than the modeling itself. Examples of public systems 
challenges can include the following: 


e Criminal justice system reform with respect to the death penalty, controlling the 
use of addictive drugs, reducing gun violence, and prison terms and conditions. 
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e Economic issues such as distribution of resources, collection and amount of 
taxes and trade tariffs, minimum wages, and sick leave policies. 

e Educational elementary and secondary educational system issues such as fund- 
ing, setting of school capacities, school districts, class sizes, staffing, and school 
food programs. 

e Legal system policies with respect to sports betting, sexual harassment, afford- 
able housing, immigration policy, disaster response and insurance requirements, 
drinking water and air quality standards, driving speed limits, gun control, data 
privacy, voter registration and voting rights, political redistricting, child abuse, 
and domestic violence. 

e Environmental systems policies related to water and air quality and noise, 
clean energy and climate change mitigation measures, and wetland and wildlife 
protection. 

e Health system issues such as healthcare access; use of opioids, medical 
marijuana, and prescription drugs; insurance requirements; and controlling 
pandemics. 

e Social system issues such as welfare policies, homeless management, food pro- 
grams, police protection, workers and labor union rights, animal rights, and 
social media regulations. 

e Transportation system issues involving the use of motor vehicles, bikes, scoot- 
ers, and buses, pedestrian walkways, licensing, infrastructure capacity and 
maintenance, and control of drones and airplanes. 


These, like many public sector systems, often have design, organization, and man- 
agement issues that can be analyzed to identify and evaluate alternative ways of 
addressing them. Obviously, we can’t address each of these issues in this intro- 
ductory book but we can begin to introduce some of the tools that one might use 
to analyze such issues. The problems in this book are simpler than those listed 
above, but still interesting or complex enough to warrant and illustrate the use of 
what is called systems analysis. Systems analysis includes developing and solving 
models of systems. Solutions of models can help us determine what, where, when, 
and how much to do to accomplish some goal or objective. We will use various 
modeling approaches to identify preferred system designs and operating policies 
with respect to various objectives or goals that might be considered. 


2.2 Why Apply a Systems Approach to Public Policy? 


The increasing development and use of technology and the automation and infor- 
mation it brings into our lives are creating challenges in our workplaces as well 
as for both our education and health and welfare systems. Ensuring a high-quality, 
active life for an aging population puts pressure on developing improved ways 
of providing medical and social care. Climate change, obesity, radicalization of 
social behavior, income inequality, and poverty are all issues faced by today’s 
public policymakers. What causes these and other public policy challenges and to 
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what extent? How can they be effectively addressed without generating even more 
problems? 

More holistic policy approaches that can define the major factors impacting a 
policy issue, that can identify the interactions among relevant components of sys- 
tems, that can focus on the performance of the whole system rather than only on its 
separate components, and that embrace the goals of stakeholders have the poten- 
tial to substantially improve the policy-making process. Such systems approaches 
can inform policymakers on the impacts of what they might decide to do and thus 
allow them to focus on the bigger picture, i.e., on the areas where change can have 
the greatest impact and on the goals they want to achieve. 

Government agencies and those NGOs and others who serve them are increas- 
ingly using systems approaches to address problems and to identify and evaluate 
possible decisions impacting the performance of their policies and programs. Pub- 
lic institutions are slowly changing from a procurement-driven policy of only using 
external consultants and contractors to perform systems studies, toward employing 
systems analysts to have systems analysis capabilities within their organizations 
and to be able to perform analyses continually as part of their everyday practice. 

Implementing change in the public sector can often be difficult. Not everyone 
wants the same change, or even any change. Decision-makers are typically risk 
averse especially regarding the possibility of failure. In many cases, one cannot 
stop providing an existing public service, such as air traffic control, or water and 
wastewater treatment, or protection from natural hazards, as changes are made in 
providing those services. Systems approaches can help navigate such transitions. 
Systems approaches can help organizations continue to provide services while 
changing the design and/or operation of the entire system at the same time. 

Changing a system or service often requires building new skills into organiza- 
tions to help them face and adapt to new circumstances. Systems changes impact 
people as well as infrastructure. As such, they invariably spur debates about the rel- 
ative value of policy choices and the tradeoffs among conflicting goals to be made. 
Consider the efforts of public health experts attempting to achieve higher percent- 
ages of vaccinated people. This has proved to be more difficult than expected even 
when it would seem the best decision for each individual is obvious, at least for 
those wishing to avoid sickness or death. In the case of car-sharing in Canada, 
having a flexible transportation system took precedence over other work condi- 
tion concerns. In Iceland, domestic violence had to be labeled a public health 
issue rather than a private matter to gain public support. It is not easy to trans- 
form public systems and public opinion. But again, applying systems methods to 
identify and evaluate alternatives and their benefits, costs, and possible environ- 
mental, ecological, and social impacts can help provide the information needed to 
help generate the support and understanding needed to enable change to happen 
(OECD, 2017). 

Complexity and uncertainty are common properties of public systems. The 
defense and intelligence communities refer to this state as “VUCA’, a state of 
Volatility, Uncertainty, Complexity, and Ambiguity. One can argue that VUCA 
characterizes much of the public sector as a whole, even if administrations do 


2.2 Why Apply a Systems Approach to Public Policy? 19 


Fig. 2.3 Getting into the detail may reveal entirely different perceptions of urban sub-systems 
needs than at higher levels of policy-making 


not understand how, where, or why. One key concern is how best to account for 
uncertainty while managing greater complexity and still deliver effective services. 
To a degree, the answer lies in a policy-making approach that leads to robust sys- 
tems and adaptive policies. The effectiveness of the decisions made to address a 
problem or issue will depend on how completely the problem and the system it 
is a part of are understood. It also requires acknowledging uncertainty as part of 
everyday decision-making. Changing public policy dealing with problems stem- 
ming from interconnectivity, cyber threats, climate change, changing demographic 
profiles, and migration, to mention a few of today’s issues, is not easy. The com- 
plex process of seeing, understanding, and deciding is fundamentally challenging 
our institutions. Appropriate use of systems approaches and modeling can often 
help inform those involved in this process. They can help policymakers identify 
what, at a more detailed systems level, may be impacting their view of the sys- 
tem at a higher level. Figure 2.3 shows, at least conceptually, how a system may 
look quite different at a detailed, say at ground level, compared to at 3000 feet or 
1000 m—the higher level. Both reveal information the other does not. 

Public policymakers have traditionally dealt with social problems by mak- 
ing only incremental change decisions. While often perceived as being a safer 
approach in terms of political risk, such incremental changes may only shift con- 
sequences from one part of the system to another or just address symptoms while 
ignoring causes. Part of learning the art of developing and applying systems mod- 
els is in defining the system that is to be analyzed. Typically, each component of 
a system is a system in itself, and hence, the detail to be included in a model 
of a system of systems is determined by the modeler. Clearly, it also depends on 
the issues being addressed, the time and data available to address them, and the 
questions being asked and the decisions being considered by policymakers, which 
indeed can change over time. The umbrella phrase ‘systems approaches’ is used 
to describe a set of processes, methods, and practices that aim to define systems 
and improve their performance. Using systems approaches to address public sec- 
tor problems and issues can prove challenging for many reasons, and one may 
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be due to limited institutional authorities and capabilities, but applying them can 
sometimes motivate changes in institutional missions and structures as well. 


2.2.1 When to Use the Systems Approach 


It is reasonable to ask when does it make sense to consider using a systems 
approach to address a public policy issue. What are the necessary conditions? 
What unknown decision variables should be considered? In other words, what is 
to be decided? What is to be achieved? There are no common answers to these 
questions because each situation is different. However, in general, if the following 
conditions apply, the use of systems analysis methods within an institution may be 
beneficial. 


e An ‘innovative’ attitude and desire for improving the services provided by a 
decision-making institution, whether local or national or international. 

e The inclusion of stakeholders, the public, in decision-making is not only 
possible but a priority. 
Satisfying stakeholder interests is an institutional goal. 
There is sufficient trust and capacity in government to think outside the box, 
i.e., to experiment. 

e Policy issues are complex enough to be difficult to address within disciplinary 
or institutional silos. 

e There exist one or more champions (persons or institutions) committed to 
leading the study and able to implement change. 

e There exist sufficient funding and time and data and expertise to perform the 
analyses. 


Policing, community recreational services, environmental protection, planning, 
forest, crop and water management, housing, infrastructure capacity expansion 
planning, waste disposal, and energy production and use are all domains in which 
systems approaches have shown to be of value. In later chapters, you will have an 
opportunity to model such systems. The common denominator is that these ser- 
vices directly interface with the needs and lives of people whose expectations and 
realities are changing in an environment of technological, economic, and global 
change. Successfully addressing an issue today does not mean it will not have to 
be addressed again at a later time. 


2.3 Data: Are There Ever Enough? 


To understand policy problems better, analysts require data. Models can be helpful 
in identifying just what data are needed to make decisions. Modeling can be help- 
ful in identifying not only the types or kinds of data but also their needed accuracy. 
Just how this is done will be illustrated in some of the following chapters of this 
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book. For example, a model developed in Chap. 8 for finding the least-cost pol- 
lution control policy can identify the least-cost decisions even without knowing 
the precise costs of those decisions themselves. Hence, the common temptation to 
divide a systems study into two parts, the first being to collect all the available data, 
and the second part to think about how to use these data, should be replaced with 
a simultaneous coordinated modeling and data collection effort. Models can help 
identify what is needed, and data collection can identify what data are potentially 
available. 

Today, collecting ‘enough’ of even the needed data for some policy analysis 
studies may be too resource-intensive or even impossible. The sufficiency of infor- 
mation is always an issue. In such situations, how to proceed with confidence? 
There is often no definitive answer. But it is worth remembering that the results 
of models are always based on assumptions. They address and provide answers to 
‘what if’ questions. This allows decision-makers to focus on what they think the 
best assumptions might be rather than on what is best given some assumptions. 


Appendix 
Some Case Study Summaries 


(a) Tackling domestic violence in Iceland. 


The Icelandic government has used systems analysis to develop and implement 
a program addressing violence against women. The program introduces a new 
integrated support system for victims based on the concept that domestic violence 
is a social (and not private) harm affecting everyone. Following research findings 
on domestic violence, and supported by new legislation, the program supports the 
victim and concentrates on stabilizing the family, rather than focusing on providers 
and authorities (lawyers, police, social services, etc.). Today, the police, social, 
and child protective services (and increasingly schools and healthcare providers) 
are working in a coordinated fashion to detect and respond effectively to domestic 
violence across Iceland (OECD, 2017). 


(b) Reshaping the child protection services in The Netherlands. 


CYPSA (Jeugdbescherming Regio Amsterdam) is a regional Dutch organization 
certified to provide Child and Youth Protection Services in the Amsterdam area. 
Since 2008, the organization has worked to redefine its purpose using a systems 
approach. As a result, the organization adopted a new mission for its activities 
entitled “Every Child Safe, Forever.” CYPS redesigned its entire system to fulfill 
that purpose and ensure it had a meaningful impact (OECD, 2017). 


(c) Regulating Public transportation in Toronto, Canada. 
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Disruptive technological change and the emergence of the ride-sharing economy 
are at the core of this case study. In Canada, digitalization impacts all levels of gov- 
ernment—city, province, and federal. Policies connected to emerging fields of the 
economy (e.g., housing and transportation, insurance, taxation, etc.) are regulated 
at different levels of government. This creates a problem—who has ownership over 
a governance issue? In 2014, the transportation network company Uber started to 
operate in Toronto without specific regulatory oversight. To tackle the regulatory 
challenge and simultaneously preserve the beneficial aspects of a sharing econ- 
omy, an independent arbiter using systems methods proposed a sharing economy 
strategy for Toronto (and by extension cities across Ontario). They also helped 
develop new legislation that enables the city and its citizens to both regulate and 
benefit from new entrants that disrupt old businesses (OECD, 2017). 


(d) Deciding how to share the Nile (Ethiopia). 


The continuing conflict in the Nile River Basin between Egypt, Sudan, and 
Ethiopia over the filling of Ethiopia’s newly built Grand Ethiopian Renaissance 
Dam is perhaps one of the best examples of an international ‘wicked’ water man- 
agement problem. So far, after a considerable number of modeling studies by just 
about every academic, consulting firm, NGO, and agency or research institution 
that models water, including modeling studies designed to check up on the results 
of other modeling studies, no acceptable solution is apparent. This is in spite of 
negotiations that continue to take place at the highest government, and even inter- 
national, levels. Downstream Egypt and Sudan do not want any increased risk of 
not having the water they consider they are entitled to, and upstream Ethiopia 
wants to fill the dam so as to maximize hydropower production to help meet the 
considerable demand for electrical energy in their country and in the surrounding 
region. Water stored in a reservoir or that evaporates from the reservoir is not then 
available downstream and that scientific fact for Egypt and Sudan is unacceptable. 
All water allocation issues can turn into wicked ones that have no solutions when 
there is an unwillingness to compromise or think outside the box in order to enlarge 
the options for achieving an acceptable water management policy (El-Fekki & 
James, 2021). 


(e) Modeling ecosystems in the Great Lakes (Canada and US). 


A joint Canadian-US five-year 20-million-dollar systems study to identify 
improved operating policies for controlling the lake levels and river flows of the 
lower Great Lakes basin began over two decades ago. The study was undertaken 
by the International Joint Commission that oversees the management and oper- 
ation of all boundary waters between the two countries. The Great Lakes serve 
multiple purposes and users. These purposes include hydropower production, ship- 
ping, commercial fishing, recreational boating, shoreline protection, and ecosystem 
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enhancement. Ecosystem enhancement is often in conflict with other goals, espe- 
cially shoreline preservation. Floodplain ecosystems benefit from some variation in 
water levels and flows, whereas shoreline owners would prefer low constant levels 
that cause less erosion. Higher and more constant lake levels are preferred for other 
purposes if they are below flood stage. Furthermore, benefits derived from all the 
purposes but ecosystem enhancement can be expressed in monetary terms. But the 
main motivation for this study was to find operating policies that better protected, 
and in fact restored, wildlife habitat along the shores of the lakes and downstream 
river. At one point during this study, the US co-chair of the IJC requested a ben- 
efit—cost analysis that included all the purposes served by the Lower Great Lakes 
system, including ecological habitat restoration. He specifically wanted to know 
the dollar value of a muskrat since the main conflict was between what shore- 
line owners wanted and what ecologists assumed muskrats (representing wetland 
habitats) wanted. Without being able to justify a specific dollar value for ecosys- 
tem enhancement, the study ended after 5 years without that benefit-cost analysis 
and thus without a decision. The commissioner claimed later that not getting that 
analysis was one of the reasons no decision on a revised operating policy was 
made—until nine more years of further analyses and political debates (IJC, 2006). 


(f) Needing an interested client (Ghana). 


A few years ago, the African Development Bank funded a project exploring the 
possible reoperation of the Akosombo Dam on the Volta River. This hydroelec- 
tric dam in southeastern Ghana is operated by the Volta River Authority. Since 
the beginning of its operation in 1965, the dam’s discharges have degraded the 
downstream ecosystem of the river and its floodplains. This in turn has adversely 
impacted those living downstream of the dam. The aim of the project was to find an 
alternative operating policy that would restore the downstream ecosystems while 
still meeting electrical energy demands. The institution overseeing the project was 
the power authority. It had the authority to alter the dam’s operating policy, but pro- 
ducing power and generating electricity were their main missions and objectives. 
Here come these foreign scientists and modelers on relatively short visits to work 
with the authority and to help them obtain the data and develop the necessary 
models needed for establishing a reoperation policy and estimating its impacts. 
While spending considerable time with many of the impacted stakeholders as well 
as with the staff of the power authority during those visits to Ghana, the authority 
made it clear during each visit that ecosystem restoration was not their mission or 
interest. It might not have made any difference, but not being able to work closely 
and continuously with all involved in the project surely contributed to the failure 
of the modelers to gain the level of trust and understanding needed to enable a 
successful reservoir reoperation result (Opgrand et al., 2019). 


(g) Modeling the Great Man-made River (Libya). 
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The Great Man-made River in Libya is a system of wells, pumps, pipes, and 
reservoirs designed to bring water from aquifers in the Sahara Desert to where 
water is needed along the Mediterranean Sea coast where most Libyans live and 
irrigate crops. Optimization models were used to identify cost-effective designs 
and operating and capacity expansion policies and to compare their costs to the 
costs of other alternatives for satisfying Libya’s water demands. Getting the data 
to enable that modeling proved to be a challenge. Individual government agencies 
considered the data they had gave them power and were not willing to give that 
up. Only until some degree of trust was developed (on the squash courts) between 
the foreign modelers and agency personnel did it become possible to obtain the 
needed data. 

As a footnote, during the planning and construction of the Great Man-made 
River, several engineers convinced the New York Times, a major and trusted news- 
paper in the US, that instead of being a water distribution system the project was 
really intended for transporting troops and tanks in trucks and trains to where 
Libya could invade Libya’s neighboring countries without being seen by satellites. 
This ‘news’ was published on the front page of the New York Times, whose motto 
is ‘all the news that is fit to print, on December 2, 1997. Indeed, it supported the 
popular notion that Libya’s government was not to be trusted (Bonner, 1997). 


(h) Water and qat security (Yemen). 


Sana’a, the capital of Yemen, depends on an aquifer for its water. Years ago 
a groundwater modeling study showed that this aquifer would be depleted in a 
decade or two due to excessive withdrawals. Most of the groundwater withdrawals 
were being used for growing qat, a green-leaved plant that has been chewed by 
Yemenis for centuries for its stimulant effect. Asking Yemenis to restrict their 
chewing of qat would be similar to asking coffee drinkers to restrict their drink- 
ing of coffee. Finding a socially as well as economically acceptable solution to 
this water management problem proved to be difficult. When suggesting to policy- 
makers that perhaps this issue should be discussed in public in hopes of enlisting 
their help and support in identifying a suitable solution, they, the policymakers, 
rejected the suggestion. “Why should we worry about this potential crisis? When 
it happens, we may not even be alive.” 


(i) Restoring the Florida Everglades (United States). 


An example of having to adapt to unforeseen consequences involves the long-term 
project to restore the ecological health of the vast Everglades wetlands in the state 
of Florida in the US. Begun two decades ago, this project is arguably the most 
ambitious ecosystem recovery effort anywhere. It is in some sense in response 
to past management decisions that focused on development and did not consider 
preserving this unique environment. The project is essentially a vast re-plumbing 
scheme aimed at replicating as nearly as possible the historical freshwater flows 
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over the flat wetlands of the Everglades—often called the River of Grass—that 
once made South Florida a biological wonderland. These flows were diverted 
when in the late 1940s the US Army Corps of Engineers initiated a flood con- 
trol project aimed at protecting land for urban and agricultural development. Over 
a half-million acres were drained by a network of levees, canals, and pumping sta- 
tions. While making Florida’s eastern coast and midlands safer for development, 
it also destroyed much of the Everglades ecosystem including its wildlife. Now 
people care more about this unique ecosystem and the environment than they did 
when the decision was made to ‘drain the swamp’. The ongoing restoration project 
involves taking out much of that drainage and diversion infrastructure and restor- 
ing the overland flows to their original patterns to the extent possible to maintain 
what remains of this unique environment and ecosystem. The project is being 
informed by numerous simulation models and modelers from multiple federal and 
state agencies, each responsible for addressing a range of issues. The hope is that 
this unique ecosystem will continue to motivate people to visit (and spend their 
time and money in) Florida (Grunwald, 2006). 


(j) More water management modeling (Africa, Asia, Europe, and US). 


Successful examples of effective ongoing use of the systems approach to inform 
those managing water include the Mekong River Commission’s Decision Support 
Framework (Mekong DSF), the Nile Basin Initiative’s Decision Support System 
(NB DSS), and the flood forecasting model, FloRiAn, of the International Com- 
mission for the Protection of the Rhine (ICPR), the Corps’ Water Management 
System (CWMS) used by the U.S. Army Corps of Engineers to support its regu- 
lation of river flows through reservoirs, locks, and other water control structures 
located throughout the US. Other water allocation models are being used to inform 
managers of the Senegal and Zambezi Rivers in Africa and the Euphrates and 
Tigris Rivers in the Middle East, the South-North water diversion project in China, 
and in the operational management of Lake Como in Italy (FAO, 2021; Stakhiv 
et al., 2020; Todini, 2014). 


(k) Educating young modelers (US). 


When in the 1970s the Clean Water Act and its Amendments were passed in 
the US, they required all point sources of wastewater to be treated using ‘best 
management practices’ (that generally meant secondary treatment that removes 
about 80% of oxygen-demanding pollutants from wastewaters) before discharging 
them into receiving surface water bodies. The CWA policy became an expensive 
national public works program. Model studies showed that considerable money 
could be saved by adopting cost-effective policies, policies that met surface water 
quality standards at a minimum cost. In terms of infrastructure construction and 
operation costs, the CWA policy was expensive, but politically it was cheap. To 
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enforce the CWA policy required monitoring only the quality of wastewater treat- 
ment plant influents and effluents, an easier task than monitoring the quality of 
wastewater influents and effluents and receiving surface water bodies. Modelers 
who could identify more cost-effective wastewater treatment policies for particular 
watersheds and river basins did not have to defend their models, along with their 
assumed model parameter values, in court. Every potential polluter was treated 
equally. Investigations into which polluter upstream contributed to a water quality 
standard violation downstream, and by how much, were not necessary. Politically, 
the CWA policy was a much easier and less costly policy to implement. So much 
for the education of those advocating cost-effectiveness. 


(1) Food security (Algeria). 


To become more self-sufficient in feeding its people, the government of Algeria 
initiated a study (in the 1970s) aimed at identifying the sites, design capacities, and 
operating policies of infrastructure needed to capture, store, and deliver irrigation 
water to parts of the Sahara Desert for growing crops. The system performance 
measures the government wanted considered were infrastructure instillation and 
operating costs and the amount and reliability of water delivered. The task of the 
modelers was to identify alternatives that represented efficient tradeoffs among 
these three conflicting objectives. Upon presenting some results for one region of 
the country the government chose an inferior solution, one that cost more, was less 
reliable and produced less water than many other possible solutions. When asked 
why that plan was chosen, the answer was that their chosen plan satisfied other 
objectives better. This is an example of the fact that the set of project objectives 
and their relative importance can change during a modeling, planning, and policy- 
making process, especially as all involved learn more from the modeling and other 
sources about what is possible and hence what can be achieved. 


(m) Asking the right questions (Cambodia). 


In the Mekong basin, as in many other river basins in this world, hydroelectric dam 
builders are busy practicing their trade to meet increasing demands for energy. In 
one recent study, the question being addressed was where to site and how to design 
and operate a series of dams to produce hydroelectric power. Framing the question 
in this manner leads one to identify dam sites and hydropower plant capacities and 
reservoir operating policies needed to meet specified energy targets. Framing the 
question to be how to produce more energy leads to a broader range of options 
including the consideration of solar panels on existing reservoirs. In the Lower 
Mekong, solar power was shown to be a much less expensive option than building 
and operating more dams and less damaging to the ecosystems and biodiversity of 
the river. This information had an impact on a decision not to build a particular 
dam that was planned. For how long that decision will apply, who knows (Ratcliffe, 
2020; Thomas et al., 2017). 
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(n) Achieving cleaner air (Europe and India). 


Where several different goals compete, modeling can help to find a balance. A 
highly successful example is the Regional Air Pollution Information and Sim- 
ulation Model (RAINS). In the 1990s, RAINS helped to guide Europe’s policy 
on six air pollutants, including particulates and sulfur dioxide (the chief cause of 
acid rain), calculating costs and health effects of various policies. RAINS results 
in Europe and India have shown the power of cooperative action on air pollu- 
tion, which is much more effective than efforts by any single state and, therefore, 
more politically attractive. Now extended to include greenhouse gases, the Green- 
house Gas and Air Pollution Interactions and Synergies (GAINS) Model reveals 
how clean-air policies can have co-benefits, improving the health of people and 
ecosystems while also curbing climate change (Battersby, 2021). 


Lessons from These Case Studies 


The application of systems studies of public policies is often triggered by a per- 
ceived crisis or opportunity. This may take the form of an actual crisis or a 
perception that the current performance of a public system could be better. All 
the case studies highlight that someone needs to have a vision and take direct 
ownership of the problem. All the case studies outlined above exhibited either 
some level of urgency or obvious opportunity to serve the public better that moti- 
vated the systems analyses. This in turn created a window of opportunity. Would 
the domestic violence project have developed if Iceland had not experienced a 
social or fiscal crisis? Would the modeling projects in the Nile, in Libya, and in 
Florida have taken place without some sense of urgency? Probably not. In short, 
the acknowledgment of cumulative severe effects can lead to a sense of urgency 
or crisis. However, the case studies from the Netherlands and Algeria and Yemen 
indicate that it is difficult to implement changes during truly chaotic moments in 
organizations, as some level of stability must be reached to initiate a broader sys- 
tems study. The stakeholders involved in such situations need to retain a sense of 
urgency, even in a stable environment. Maintaining the political will is an essential 
part of implementing change in more static conditions. Those at the highest level 
of an agency need to acknowledge that change is needed in the services they pro- 
vide. While achieving an agreement that there is a problem or opportunity is the 
first step, it is not enough. There has to be an agreement that something should be 
done to address the problem or take advantage of an opportunity. This agreement 
has to become actionable involving people and place. 

Once organizations recognize the need to change, they must invest the time to 
understand and articulate both the problems to be addressed and the objectives to 
be achieved. In the case of the Netherlands, this meant long internal discussions 
and the identification of a new mission: “Every Child Safe, Forever.” The organiza- 
tion understood that they needed to focus on children’s safety and to start treating 
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adults as parents first and individuals second. In the case of Iceland, broader com- 
munity discussions with the police, social services, child protection, the church, 
and so on were initiated. These reaffirmed the notion that domestic violence is 
a public health issue and not a private matter, thus prioritizing the social effects 
of violence over privacy. In Canada, the value debate made it clear that a more 
flexible, affordable transportation system was preferred over other concerns. In 
the case of Ghana, the responsible organization never considered a change to be in 
their interest, as indeed it might degrade the service they were currently providing. 

When implementing change, stakeholders may suggest many objectives or goals 
to be achieved. Some goals may conflict with others. This was the case in the 
Great Lakes, Algeria, Nile, Ghana, and Florida Everglades studies. In these cases, 
systems modeling was able to identify the tradeoffs among conflicting objectives 
or performance measures. Chapter 16 in this book is devoted specifically to how 
this can be done. 

Meaningful measurement, modeling, and monitoring are key to addressing and 
finding acceptable solutions to complex problems. Without them, causality and 
the effects of interventions are often difficult to assess. In the Netherlands, a spe- 
cific measure was used to evaluate child safety—‘acute child safety’. In Iceland, a 
new risk framework was adopted. In the Canadian case study, the whole process 
was initiated to produce a legitimate evaluation of the impetus for change. Con- 
sequently, modeling served as a communication tool used to justify the process 
of systems change and the use of systems approaches themselves. The evaluation 
carried out by the Institute for Gender, Equality, and Difference at the University 
of Iceland, regarding the domestic violence project, helped to keep the process 
going. In Toronto, the facilitator’s evaluation, alongside additional federal and non- 
governmental reports, paved the way for the city to advance the sharing agenda. 
Agencies involved in the restoration of the Florida Everglades are typically spend- 
ing over $50 million annually for modeling and monitoring and data management. 
They clearly believe if you cannot measure and monitor, you cannot manage 
needed change. 

A number of other factors emerged from these case studies. First, contextual 
factors impact systems change. Timing is important and supporting elements must 
come together to create a ‘window of opportunity’. Second, different resources are 
needed for systems change - , time, finances, capabilities, and legitimacy, all of 
which require leadership and sustained political support. 

However, leadership alone is not sufficient. Based on the case studies, it is dif- 
ficult to say which factors were the most influential, but it is clear that different 
elements have to be in place to make change possible. Moreover, systems change 
is a continuous process and it is essential to ensure feedback with regard to unin- 
tended consequences and unforeseen conditions during the implementation phase 
and beyond. Monitoring is critical to being able to decide if and when to adapt 
and make further changes. 

Modeling, as objective and value-free as it tries to be, cannot insulate itself 
from value judgments and decisions. Values enter the modeling process even in 
the framing of questions to address and objects of study, in decisions about what 
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gets funded, in the selection of data to be collected, and in the analytical methods 
to be used and the scope of the analyses. Values also play a role in deciding 
what scientific evidence, including modeling results, are deemed appropriate to 
be communicated, and how they are to be presented. Just how effective modeling 
studies are in informing stakeholders and policymakers depends on just how much 
trust exists between them. Trust in modeling increases if modelers are engaged 
and open with the people they want to inform and influence. 


Exercises 


1. Under what conditions might it be appropriate to apply systems modeling methods 
to public sector issues? 

2. What is the purpose of developing and using modeling methods? 

3. What is a measure of modeling success? 
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Creating Models 


ABSTRACT 


Introduces the approach to developing optimization models for identifying 
and evaluating alternative designs of some infrastructure. The chapter distin- 
guishes between two general types of models, optimization and simulation, and 
continues the discussion of the advantages and possible pitfalls of modeling. 


System analyses involve modeling. The only way I know how to become good 
at model development and use is to practice. Opportunities to practice are given 
throughout the remaining chapters of this book starting in this chapter. 


3.1 Let’s Model 


To develop mathematical models, we need to use some notation for defining sys- 
tems and their inputs, outputs, and various measures or indicators of performance. 
This chapter uses some simple examples to illustrate the modeling process and 
some common notations modelers use. 

Many models consist of equations and inequalities that contain variables 
whose values are unknown and parameters whose values are assumed known. 
Together they define the system components and their interactions, and the system 
performance measures. 

For example, consider creating a local community park having a specified area, 
A, that is to be surrounded by a fence. The perimeter of the park, P, i.e., the total 
length of fencing, is to be determined (Fig. 3.1). 

Depending on the area’s dimensions, which we don’t know, there are many 
possible values of P for a fixed area A. 

Consider a rectangular area as illustrated in Fig. 3.2. If the area is rectangular 
with length L and width W, then the area A is LW and the total length of fencing 
P is 2L + 2W. 
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Fig.3.1 A park area surrounded by a fence 


Fig.3.2 A rectangular area Length 
having length L and width W 


There are many combinations of L and W that can enclose a specified land area A. 
If we want to find the minimum value of P needed to enclose the specified area, 
A, and it is not already obvious, we can develop an optimization model of this 
system. Optimization models have an objective function that is to be maximized or 
minimized and various constraints that define the relationships among the system 
variables and parameters. In other words, they define the system. In this case, the 
objective is to find the minimum value of the length of fencing P that encloses the 
known area A. Hence, the model’s objective function is as follows: 


Minimize P the length of the fence needed 
and expressions that define P in terms of the dimensions of A. 
P>2(L+W) The length of the fence must at least surround the area. 


A < LW The area must be no less than some specified (known) A. 
P>0 The total length of the fence is non-negative. 


3.1 Let's Model 33 


Fig.3.3 A circular area 
having radius R and 
circumference P 


L>0, W>0 The variables L and W are non-negative. 


The objective function and all the inequality constraints just listed make up a 
model of this rectangular park. The variables P, L, and W are the unknown decision 
variables. The known area A is a parameter, along with the number 2. 

If the area of the park is a circle, the radius, R, of the circle is unknown but is 
constrained by A (Fig. 3.3). 


A< rR 


The value of x is a known parameter, 22/7. 
The needed fencing must at least surround the circular park. 


P>2mRandR>O0.P>0 


The two unknown variables are non-negative. 

Now obviously the solution to the circle problem is P= 2 x R and R= ./(A/n) 
so we don’t need an optimization model to find the minimum value of P. But in 
the case of a rectangle, it may not be obvious what the values of the L and W 
are that minimize P given A. But even here a little thought will convince anyone 
that L will equal W and thus each will equal the square root of A, ./A. But if the 
fence had to be of different types for the four different sides, each costing different 
amounts per unit length, and the objective was to minimize total cost, the solution 
would not be so obvious. 

Before leaving this park problem, an equivalent modeling approach is to maxi- 
mize the area, A, of the park given a fixed known length of fencing, P, available. 
Its solution will be the same as the solution to the previously defined models if 
the input parameter values are the same. 

In the real world, this community park fencing problem may be a little more 
complex in that neither a rectangle nor a circle is desired or possible. Also of 
possible interest may be the gain in fencing that may be required for a unit gain in 
the area. One can determine these values by changing the parameter value A and 
resolving the model. 
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Fig.3.4 Dimensions of a 
circular and rectangular tank 


These simple examples serve to illustrate what modeling may look like and 
some of the notation used in defining models. Models consist of mathematical 
expressions that define the objectives or system performance measures as well as 
the constraints that specify conditions that have to be satisfied while minimizing or 
maximizing an objective function. The mathematical expressions contain decision 
variables whose values we seek and parameters whose values we assume we know. 

The models just developed involving areas and their perimeters can be extended 
to consider three dimensions, i.e., volumes, rather than just two. Referring to the 
tanks shown in Fig. 3.4, there are many possible combinations of values of their 
dimensions that will satisfy any specified required volume, V. The best combina- 
tion of values for the dimensions will depend on the design objective. One possible 
objective might be to minimize the area of material used for the tank’s sides, its 
base, and top. Another may be to minimize the total cost of the tank’s surfaces, 
where the costs per unit area of each surface can differ. Models can be developed 
that when solved will identify the values of l, w, and h of a rectangular tank, or R 
and H of a circular tank, as shown in Fig. 3.4, that achieve some objective, while 
meeting a volume V constraint. 

There are many ways one can model this design problem. Different people may 
create different models, all of which when solved will yield the same solution if 
the assumed objective and parameter values are the same. Modeling is an art, and 
different artists rarely paint the same scene in the same way. But all models consist 
of equations and inequalities and each term within each equation or inequality has 
the same units of measure. 

Assume the goal of a community public works department is to increase the 
reliability of the community water supply. They can do this by building a water 
storage tank. The greater the tank capacity, the greater will be the water supply 
reliability. But the greater the tank’s capacity, the greater its cost. Assume the 
community doesn’t want to spend more money than it has to but it has not decided 
what that amount should be. To help them make such a decision, they would like 
to know the relationship between cost and tank volume. Obviously, for a specified 
volume, there could be many costs depending on the tank’s dimensions. Hence, 
what is desired is the function defining the minimum cost associated with any 
specified tank volume. In other words, it wants to know the tradeoff between tank 
volume capacity and its minimum total cost. This tradeoff can be defined using an 
optimization model. 
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Fig.3.5 Minimum cost 

function derived from the Total 
solution of the minimum cost 
model for various values of 
volume V 


Cost 


Volume V 


What costs money are the surfaces of the tank. For a rectangular tank, these 
surface areas are defined by the tank’s length /, width w, and height h. The cost 
per unit surface area may depend on the particular surface area, whether it is the 
tank’s bottom, sides, or top. 

The rectangular tank’s capacity or volume, V, is the product of its length, width, 
and height, lwh. 

To minimize the tank’s total cost, we are minimizing the cost of the sides having 
a total area of 2(wh + Ih) and the top and bottom each having a total area of lw. 
Multiplying the unit cost (cost per unit area) of each surface area (CsE, Crop, 
and Cgase) times the area defines the total cost of that surface area. Adding these 
total surface costs gives us the total tank cost. 

The minimum cost model can be written as follows: 


Minimize Total_Cost = Csmeg2h (l + w) + (Crop + Cpase) (lw) 
Subject to: lwh > V. 


Solving this model for various values of the volume, V, will define the min- 
imum cost function for storage volume, as illustrated in Fig. 3.5. Knowing the 
minimum (and marginal) cost associated with any particular volume should be 
useful information to those having to decide what the tank’s capacity should be. 

In this example as with the others, there are many possible feasible solutions, 
i.e., solutions that satisfy the constraints. We identify and use an objective to deter- 
mine the best value of all the unknown decision variables (in this case /, w, and 
h) associated with that objective. Different objectives may result in different ‘best’ 
solutions for various volumes V. 

Before leaving this example problem, it is worth mentioning that there is often 
more than one way to view an optimization problem. For example, this prob- 
lem could be viewed as finding the maximum volume V that can be obtained 
given a budget constraint, i.e., the money available to spend on the surfaces of 
the tank. The variable ‘Total_Cost’ in the above model is now known, and the 
objective becomes Maximize V. Nothing else changes. Again, for various values 
of Total_Cost, the model solution will identify the maximum volume that can be 
obtained and its associated dimensions /, w, and h. 

Clearly, the values of the cost per unit area parameters, Csipz, Crop, and CBASE, 
will influence the resulting values of /, w, and h. If these unit costs are all the same, 
then we are finding the minimum total surface area associated with any volume V. 
In this case, the tank becomes a cube where l = w = h = JV. 
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3.2 Types of Models 


The examples just discussed involve finding the ‘optimal’ values of all the 
unknown decision variables of a particular ‘system’. Optimization models are used 
to find those decision variable values that maximize or minimize some function 
that represents some system performance goal or objective. Examples are the max- 
imization of net economic benefits; the minimization of costs; the maximization 
of equity; the minimization of risks of various types; the maximization of mea- 
sures of ecological, environmental, or human health; and so on. There are many 
different types of optimization models. The following chapters introduce some of 
them. They all have their advantages and limitations, and there is no one optimiza- 
tion method that is best for all optimization problems. But all optimization models 
focus on addressing ‘what should be’ the values of all the unknown decision vari- 
ables given all the assumed parameter values, constraints, and system performance 
goals. 

As opposed to optimization, simulation models focus on addressing ‘what if’. 
What will be the performance of the system given assumed values of all param- 
eters and decision variables? In these models, the values of all decision variables 
are specified, and the model output indicates how the system performs given the 
various inputs and decision variable values. 

The difference between optimization and simulation is illustrated in Fig. 3.6. 


(a) Decision Variable Values 


Outputs 


Fig.3.6 a Schematic of optimization modeling where the optimal decision variable values of a 
system are determined based on an assumed performance goal. b Schematic of simulation model- 
ing where decision variable values of a system are specified, and the performance of the system is 
to be evaluated 


3.3. Why Model? 37 
3.3 Why Model? 


The reason we develop and solve models of systems is to better understand how 
to improve their performance and to estimate the impacts of doing so. 

In both public and private sectors, there are often certain systems that may not 
be functioning as well as expected or desired, or there may be opportunities for 
modifying existing systems or building new ones that would increase social wel- 
fare or economic benefits or environmental quality or better satisfy some other 
system performance objective or goal. When there are many possible decisions or 
actions that could be taken and the best set of decisions or actions is not obvious, 
it often makes sense to use models to identify what decisions may have better out- 
comes than others. Solving models is one way of estimating the various impacts 
resulting from various decisions. We build and solve models to get useful informa- 
tion. We use models to aid us in identifying and evaluating alternative decisions 
in our search for the best. 

Public policy modeling involves the use of tools taken from the disciplines of 
economics, planning, political science, operations research, statistics and probabil- 
ity theory, and applied mathematics. When applicable and depending on the issue 
or system being analyzed, it will also draw on agriculture, ecology, environmental 
management and policy, transportation engineering, law, and other disciplines as 
applicable and needed. 

We often deal with systems that are so complex as to be beyond the limits of 
our intuitive understanding. If it is not obvious what decisions to take that will 
maximize system performance, then by definition, the system is complex. In these 
cases, we can construct models to help us study that which we seek to understand 
better. 

Whether a model is right or wrong or too simplistic or too complex is simply 
a value judgment. Whether it is correct or incorrect, or a good model or bad 
model, depends on how well it serves its purpose, given the information needed 
and the time and data available. The most important question to ask is how well it 
promotes our understanding of how to improve the design and/or management or 
operation of a system and the resulting impacts. The extent to which a model aids 
in the development of our understanding is the basis for deciding how good the 
model is. Many find that just the process of building models gives them a greater 
understanding of the system they are modeling even before attempting to solve 
them. 

When developing models there is always a tradeoff between reality and sim- 
plicity. A model is inevitably a simplification of reality. The question is always 
what to include and what to exclude. If relevant components are excluded, there is 
a chance that the model will be too simple to be useful. On the other hand, if too 
much detail is included, the model may become so complicated that, again, it fails 
to promote the stakeholder trust needed to fully accept its output. A recommended 
approach to model building is to start simple and add detail only as needed and 
after successfully solving the simpler model. 
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3.3.1 Some Cravats 


Our job as modelers is to construct models with sufficient detail to provide 
decision-makers with the understanding and precision they need or want about 
the system or process of interest and for which decisions will be made. They may 
want to know the following: 


What to do. 

Where to do it. 

How much to do—to what extent. 

When to do it. 

Why—what are the economic, social and environmental, or other impacts? 


These questions should be answered at the level of detail, and in terms, appropriate 
for the level of decision-making and issues being addressed. 

Modeling can help address these questions but will be based on a given set 
of assumptions. What are the best assumptions? Models can be helpful in deter- 
mining the best decisions given the assumptions, and the objective(s), but not on 
identifying what assumptions are best, or correct, or true. Modeling can, there- 
fore, help focus the political debate on just what assumptions are best rather than 
spending time determining what decisions are best given any assumptions. 

This suggests that a modeler’s job is not over until a ‘sensitivity analysis’ is 
performed. In a sensitivity analysis, the assumptions should be varied over their 
likely values to determine just how sensitive the model’s decision variable values 
are to changes in the assumptions. If, as one hopes, the changes in those decision 
variable values are not significant, there may be less need to spend a lot of time 
debating the assumptions. Otherwise, there may be a greater need to find a robust 
set of decision values that will ensure satisfactory system performance no matter 
what assumptions turn out to be true. 


3.3.2 Limitations and Common Sins 


e Models cannot help us invent new ideas or creative alternatives that are not 
considered in our models. For example, a model for determining the most eco- 
nomical dimensions of a rectangular tank will not suggest a circular tank may 
be better. 

e Modeling can be seductive—the danger of modelers or users of models 
believing the model is the real world. 

e Incorrect, ambiguity, or errors in model inputs result in errors in model outputs. 
For example, what does 8/2(2 + 2) equal? One or sixteen? Different calculators 
may give different answers. 

Difficulty in verifying uncertain (future) data and assumptions. 
Insufficient attention to the sensitivity of assumptions and uncertainty analyses. 
Temptation to shape model results to what the client (or teacher?) wants to hear. 


3.3. Why Model? 39 


Fig.3.7 We all have mental 
models, and we should not 
ignore them when evaluating 
our mathematical ones 


DASS a 


3.3.3 A Word of Caution 


For anyone learning how to develop and solve various types of mathematical mod- 
els to address various problems and issues, it is easy to become enamored with 
the potential power of this methodology for identifying and evaluating alternatives, 
and indeed for finding mathematically optimal solutions. This especially applies 
to those who enjoy the subject and enjoy solving puzzles. They tend to trust their 
models. But when a computer program says an optimal solution is found, one 
should look at it and ask, does the solution make sense? Are the results surpris- 
ing? If so, there may be a good chance that there is an error in the model or its 
input. If you cannot find one, then maybe you should do all the tests and sensi- 
tivity analyses you can think of to be sure you have actually created some new 
knowledge or understanding. If that is the case, then brag about it! But more to 
the point, we all have mental models of what may be the best decision, at least 
generally if not in its details. These mental models may be influenced by factors 
not included in the mathematical ones. Hence, do not ignore your mental models 
and others who have them, including those as illustrated in Fig. 3.7. 


3.3.4 Subscripted Variables 


When constructing models, it is often convenient to use subscripts or super- 
scripts to distinguish among different variables. For example, consider allocating a 
resource to n different activities. Let the subscript i represent a particular activity. 
Then X; can represent the allocation to the ith activity. If R is the total amount of 
resources available, then an obvious constraint on all the allocations is that their 
sum cannot exceed R. 


Xi +X2 +t Xit Xn < R. 
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This can also be written using the summation sign }_. 


n 


5 X; < Ror Y X; <R. 


i=l1,n i=1 


If for some reason you wanted to know the product of all the X; variables, it 
could be written using the product sign |]. 


X1X2X3 +++ Xi Xa =|] Xi. 


Assuming each of these allocations must be non-negative, then 
X; >0,i=1,2,---,n 
or if n is understood you can use the ‘for all’ sign VY. 
Xi > 0, Vi 


It doesn’t matter what letters are used for subscripts or superscripts as long as 
what they signify are defined. 

For example, if the subscript i denotes a location and the subscript j a particular 
product, and if Xij is the number of products of type j sent to location i, then 


> Xij = total number of all products j sent to location i, 
j 


5 Xij = total number of product j sent to all locations i, 


t 


> > Xij = total number of all products sent to all locations. 


L l 


where it is assumed understood how many locations i and how many different 
products j exist and each sum includes all the values of the associated subscript. 

There will be other symbols we will be using, some of which are shown in 
Table 3.1. We will define others when we need them. 


Exercises 


1. If) ;-54 A@ = A(2)+A(3) + A), write out the sum: } j1 3 Žo<j<i (Xij). 
2. Given that }`} . represents a sum and [j represents a product of n terms, what 
is the value of Dp; J- @ + J)/Dhok =? 


3.3 


Why Model? 41 


Table 3.1 Some modeling operations and notations (The use of the constant e will be discussed 


later.) 

Symbol | Name Definition Example 

A Delta Change, difference At=t -ti 

>» Sigma sum ryi=l+2434+...4n 

Sd | sigma Double sum Ye ae + j)=(0+0)+ (14 
0+1+1)+(2+0+2+1+2+2) 

TI Capital pi | Product Te) Ae = AtA2..-An 

Vv For all Applies to all values Assuming n | Vj replaces j = 1, 2, 3, .., n 

values of j: of an index 
3. Construct a conceptual model (a picture or a node-link network) of a multiple 


component system. Then identify what decisions are to be made and potential 
objectives or measures of performance. 


. Define the ‘modeling process’ in your own words. 
. What are the possible sources of uncertainty in any planning or management 


model and how can one deal with them? 


. Distinguish between simulation and optimization. 
. Identify some pitfalls of modeling. 
. Consider the following five alternative plans for providing for more security 


and better road maintenance. Whatever the units of performance are, they dif- 
fer. Assume the alternative plans are all feasible, i.e., can be implemented but 
only one is to be selected. 


Alternative Security benefits Road maintenance costs 


A 


25 30 


10 32 


20 35 


15 21 


B 
C 
D 
E 


5 25 


Which alternative would be the best in your opinion and why? Why might a 
decision-maker select alternative E even realizing other alternatives exist that can 
give more security and road maintenance? 


. Define a mathematical model for finding the dimensions of a cylindrical tank that 


minimizes the total cost of storing a specified volume of liquid. What are the 
unknown decision variables? What are the model parameters? How would you 
solve this model? 
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Modeling Examples and Solutions 


ABSTRACT 


This chapter illustrates the development of optimization models for various 
example problems and introduces the hill-climbing approach for solving them, 
as and if appropriate. 


4.1 Introduction 


In this short chapter, the problem of allocating scarce resources to multiple users 
will be introduced and modeled. The so-called hill climbing method will be used 
to find the allocations that best satisfy some objective. Later chapters will intro- 
duce other methods of solving this allocation problem. The purpose here is not 
to emphasize resource allocation issues but to use that problem as an example to 
illustrate the model building and solution process. 


4.2 Resource Allocation 


Consider the common problem of having to supply multiple agencies with the 
resources they need to function but there are not enough to meet their requested 
allocations. In this case, assume there are three such agencies and R units of the 
resource available as illustrated in Fig. 4.1 Let each variable Aj be the unknown 
allocation to user i, (i = 1, 2, 3). For any non-zero value of R, it is clear there are 
many possible combinations of allocations that could be made. The problem is to 
find the best values of the allocations, Aj. 

There are various criteria one could use to identify just what allocations are 
best. If the benefits, Bj(Aj), associated with each allocation A; can be identified, 
then one criterion could be to maximize the total benefits, TB, obtained from all 
three allocations and then determine what fraction, fi, of those total benefits should 
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Fig.4.1 Schematic of a 
resource allocation problem 
involving R units of a 
resource and three potential 
users of those resources. Each 
A; is the allocation to user i 


Fig.4.2 Determining how to 
divide and distribute the 
“economic pie’ is a political 
decision. Center for 
Economic Policy and 
Research. Creative Commons 
Attribution 4.0 International 
License https://www.cepr.net/ 
ceprs-greatest-hits-volume- 
one/ 


be allocated to each use in some equitable way. Some economists liken this to 
maximizing the size of the economic pie (Fig. 4.2). This provides more benefits 
available to distribute. This redistribution approach assumes the existence of some 
institutional arrangement that could implement such a policy. 

A model of this problem is as follows: 


Maximize T B = a B;(Aj) total benefits 


1 


Subject to the following: 
>>, Ai < R Total allocation cannot exceed the resources available 
Aj > 0 Non - negative allocations fori = 1, 2, 3. 


If the total benefits are to be redistributed, then the portion of the total ben- 
efits, TB;, allocated to use i will be some fraction, fi, of the total benefits, TB. 
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Determining the best values of the fractions fi is a political issue. 


TB = fiTB Viand Y fi=1. 


1 


Other possible criteria include the following: 


e Minimize the sum of differences, or differences squared, between what each 

user wants, Dj, and what they get, expressed in units of Aj or TB;. (Minimizing 

the squares of the differences will tend to equalize them.) 

e Minimize the maximum difference between what each user wants and what 

they get, Aj or TB;. 

e Minimize the sum of percent differences between what each user wants and 

what they get, Aj or TB;. 

e Minimize the maximum percent difference between what each user wants and 
what they get, A; or TB;. 


What each user wants or expects, Dj, is often called a target. Deviations from 
targets usually result in economic or other types of losses. In situations where the 
targets themselves are unknown and to be determined, the objective or criterion 
could be the maximization of the sum over time of benefits associated with target 
values less the losses associated with allocations that are less than the targets. Such 
models will be discussed in more detail in later chapters of this book. 


4.3 An Example Allocation Problem 


Assume for this example that the resources being allocated are apples. The avail- 
able apples are allocated to three community farmer’s markets that modify (clean 
and package) the apples they get and then sell these apples to various customers. 
The maximum unit price they can charge their customers and still sell all they have 
is dependent on the number of apples they have available for sale. For farmer’s 
market 1 this unit price function (also called a demand function) is (6 — A1). The 
total income derived from an allocation of A, apples is, therefore, the unit price 
(6 — Aj) times the quantity A1. This product equals 6 A; — A1? and defines the 
function B,(A,). Assume B2(A2) is 7 A2 — 1.5 A2? and B3(A3) is 8 A3 — 0.5 
A3”. These are concave functions that look like hills whose slopes decrease as the 
allocations A; increase. Their maximum income values result when the allocations 
are 3, 7/3, and 8, respectively, for a total of 13.33. While not necessarily realistic, 
these functions will serve to illustrate various model solution methods. 

If the total apples available, R, equals or exceeds the sum of the allocations, 
13.33, that result in the maximum incomes, then there is no need to model the 
problem. Just make those allocations to obtain the maximum possible total income. 
However, if the available apples, R, is less than 13.33, solving a model can help 
define the allocations to each market that will maximize the total income that can 
be obtained from those R apples. 
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The optimization model for finding this maximum total income can be written 
as follows: 


Maximize TB = total income or total benefit. 


Subject to the following: 

TB = Bı + B2 + B3 Defines total benefit as sum of individual benefits. 
Bı = 6A1—Aj. 

Bo = 7A2—1.5A3. 

B3 = 8A3—0.5A3. 


3 
>> A; < R Total allocation cannot exceed the resources available. 
i=l 


A; > 0 Non - negative allocations for i = 1, 2, 3. 


4.4 Hill Climbing 


One approach to solving this model is to divide the resources available, R, into 
discrete values and then allocate each successive discrete unit of resource to 
whichever market that will result in the largest additional benefits. This works 
for this example because each benefit function is smooth and continually concave, 
i.e., the slopes of the function decrease as the allocations increase. This method 
for finding the best allocations is called the steepest hill approach. It works for 
finding a maximum value of an objective function when the functions are concave 
or for minimizing when the functions are convex. The smaller the discrete values 
of the allocations, the more accurate will be the solution. 

The sketches in Fig. 4.3 illustrate this steepest hill climbing approach for 
solving the above model. Each plot shows the benefits (on the vertical axes) asso- 
ciated with integer allocations (shown on the horizontal axes). The numbers shown 


AB, By =(6A;-Ay’) AB) B} =(7 A- 1.542) AB; By =(8A;— 0.5Ay) 


oo apececececccceccce 


5.5 


Fig. 4.3 User benefit functions B; associated with integer units of allocations Aj. Also, shown 
between the red dashed lines are the slopes of the benefit function segments, AB;/AAj, where all 
^A; equal 1. 
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between the red dashed horizontal lines are the additional benefits obtained from 
an additional allocation unit. It is the slope of the ‘hill’ in that interval of the func- 
tion. Hill climbing involves finding the steepest hill among all those remaining, 
and climbing it, i.e., allocating another unit of resources to that user. This pro- 
cess continues, allocating one unit of resource at a time, until there are no more 
resources available to allocate or, in this example, until any additional allocation 
results in a decrease of benefits. 

Referring to Fig. 4.3, each user would like to have the resources that maximize 
the value of their benefits, i.e., their income. User 1 would like 3 discrete units of 
resource, user 2 would like 2 discrete units, and user 3 would like 8 discrete units, 
adding up to 13 discrete units. 

Assume only 6 units of resource, R, are available. Clearly, all 6 units will be 
allocated since increasing benefits will result in increasing allocations up to 13. 
One way to determine how 6 units of resource could be allocated that maximizes 
the total benefits obtained from them is to divide the 6 units of resource into 
discrete units (e.g., integer values) and allocate each of them in succession to the 
user that gains the most additional benefits. Once an allocation is made, there is no 
need to change it later. Once again this is because each user’s benefit function is a 
continuous concave function. Additional benefits decrease as allocations increase. 
Thus, during the allocation process, one attempts to keep the slopes the same at 
each allocation. These slopes are called marginal benefits. 

Referring again to Fig. 4.3, if only one discrete integer unit of resource is 
available, it should be allocated to Use 3. This is because 7.5 additional benefits 
obtained from Use 3 are greater than 5.5 obtained from Use 2 or 5 from Use 1. 
This results in allocations to the three uses of 0, 0, and 1, respectively. 

The next unit of resource also goes to Use 3 since 6.5 is greater than 5.5 from 
Use 2 or 5 from Use 1. The allocations to the three uses are now 0, 0, and 2, 
respectively. 

The third unit of resource can go to either Use 2 or Use 3 since 5.5 is obtained 
from both and is greater than 5 from Use 1. Say it goes to Use 2. The allocations 
to the three uses are now 0, 1, and 2, respectively. 

The fourth unit of resource goes to Use 3 since 5.5 obtained from Use 3 is 
greater than 5 from Use | and 2.5 from Use 2. The allocations to the three uses 
are now 0, 1, and 3, respectively. 

The fifth discrete unit goes to Use 1 since 5 from Use 1 is greater than 4.5 from 
Use 3 and 2.5 from Use 2. The allocations to the three uses are now 1, 1, and 3, 
respectively. 

The sixth unit goes to Use 3 since 4.5 from Use 3 is greater than 3 from Use 
1 and 2.5 from Use 2. Hence, the final allocations are Aj;=1, A2=1, and A3=4. 
Plugging these values into the total benefit function yields 34.5. 

Note that the slopes, [Bi(4; + 1) — By(Ay — DJ[(4; + 1) — (Ai — 1], of each 
of these benefit functions at their optimal allocations all equal 4. For Use 1 at Aj 
= |, the slope between A; = 0 and A; = 2 is (8—0)/2 = 4. For Use 2 at A? = 1, 
the slope between A2 = 0 and Az = 2 is (8—0)/2 = 4. For Use 3 at A3 = 4, the 
slope between A3 = 3 and A3 = 5 is (27.5 — 19.5)/2 = 4. If discrete allocations 
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are being made as they were in this example, it is likely the marginal benefits, 
the slopes of the benefit functions associated with the total allocation to each user, 
will not be the same as they will be if the allocations are not discrete. 

For those who know calculus, you can verify that the exact slopes at each 
optimal allocation are indeed 4 for all users. For this problem, it turns out that the 
optimal continuous solution for 6 available resources is the same integer solution: 
A, = 1, A = 1, and A3 = 4. (For those not yet acquainted with calculus, it will 
be introduced and used to solve this allocation problem in Chap. 10.) 


4.5 Shadow Price 


Before leaving this example allocation problem, it is of interest, especially to 
economists dealing with the allocation of scarce resources, to see what additional 
benefits (or some other measure of performance serving as the objective) can be 
obtained from an increase in the amount of the scarce resources (denoted as R in 
the previous example). These additional benefits can be compared to the cost of 
getting more resources to see if that will yield more net benefits. This additional 
value of the objective function that is either being maximized (e.g., total benefits) 
or minimized (e.g., total costs or losses) is often called the shadow price or the 
dual variable associated with the resource constraint; in this case, `; Aj < R. In 
this example, its value is the slope of each of the benefit functions at their optimal 
allocations. For this example allocation problem, that slope is 4. What this means 
is that if R were increased by 0.1 to 6.1 instead of being 6, the additional benefits 
obtained would be about 0.4. Since in this non-linear problem the slopes of the 
benefit functions decrease as R increases, this shadow price or dual variable value 
is valid only for small changes in R. Obviously when R > 13.33, the shadow price 
will equal 0. Having more resources will not yield greater benefits. In this case, 
the constraint on R (`i Aj < R) is not binding, meaning that it does not impact 
the optimal solution. 

In general, for any optimization problem containing an objective f(X) and con- 
straints gi(X) < or > or = bj, the shadow price of constraint i is the change in the 
objective function Af(X) given a unit change in b;. 


Exercises 


1. As the supervisor of your town, you are responsible for allocating money to 
different public agencies serving the town. The allocations have been based on 
political, not economic, criteria. Each agency is expecting to get at least what they 
got last year, but because of the loss of tax revenue due to the pandemic, you do 
not have as much money to distribute as you did before. 

(a) State what you think would be a fair way to allocate the limited funds you 
have. In other words, what would be your criterion for allocating funds that 
you could defend at a public hearing? 
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(b) Develop a model that when solved would identify the allocations that meet 
your objective. Clearly define the variables and parameters you use, and the 
objective function and constraints. 

2. Blueberries 

There are three farmer’s markets that sell organically and locally grown blue- 
berries. The farmer who grows these blueberries gets 90% of the income from 
their sales; the markets get the other 10%. The demand for blueberries differs 
at each market. Some smart economist has determined that the demand (unit 
price) functions for blueberries at the three markets (m = 1, 2, 3) are 6/(1+Q}), 
7/(1+1.5Q2), and 8/(1+0.5Q3), respectively, where the Qm values are the available 
blueberries at those markets. 


Unit Demand functions for blueberries. 
ni 


price 


Qm 


How should the farmer distribute a crop ranging from 1 to 6 bushels of blue- 
berries each week to maximize the total amount of income received from all three 
markets? 

(a) Construct an optimization model and solve it using the hill climbing method, 
assuming integer bushel allocations. Identify the best distribution of 1 to 6 
bushels. 

(b) Based on the results of this hill climbing method, sketch a maximum revenue 
function for the farmer based on the total amount of blueberries available to 
send to the three markets. 

(c) How would the integer allocation of 6 bushels differ if the overall objective 
were to maximize the total income from all three markets while keeping their 
individual market incomes as close to being the same as possible? 

3. Suppose you wish to minimize flood risks in two towns. Flood risk is measured in 
expected property damage. You have $2 million to spend on flood risk reduction. 
Construct an optimization model and solve it using the hill climbing method to 
determine where to spend the $2 million that maximizes total reduction. 


Investment, $10° Total reduced risk 

Town A Town B 
1 12 18 
2 22 27 
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Models for Managing Money 


ABSTRACT 


The chapter reviews ways of computing equivalent present, future, or equal 
annual values that can be used to compare different time series of costs and ben- 
efits. It defines both simple and compound interest modeling, and the impacts 
of within-year compounding, inflation, and income taxes. 


5.1 Introduction 


This chapter serves two purposes. One is to introduce some methods used to con- 
vert costs and benefits at different time periods to equivalent values at other time 
periods, and the other is to show how to evaluate options for managing our own 
financial resources. All this involves modeling. 


5.2 The Time Value of Money 


Figure 5.1 illustrates the time value of money. For example, assume you have won 
a cash prize of $10,000. You can either receive it now, option A, or receive it 
in three years, option B. The offer is hypothetical, but play along. Which option 
would you choose, and why (Fig. 5.2)? 

If you’re like most people who prefer having more rather than less money, you 
would choose to receive the $10,000 now, option A. After all, three years is a long 
time to wait. Why would any rational person prefer being paid later when he or 
she could have the same amount of money now? For most of us, preferring to have 
money now than later is just plain instinctive. And why? 

Having $10,000 now allows you to spend it now. If you do not need it, you 
can loan it to someone who does need it now, and for that loan, the receiver can 
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Fig.5.1 The value of money can grow over time and the more time the more money. The initial 
investments shown are assumed to continue each year up to the age of 65 compounding at an annual 
rate of 5% 


Present Value Future Value 
0 1 2 3 «Years 
e o 


OptionA $10,000 ——-_-—————————-> $10,000 + interest 


OptionB $10,000 - interest «<——_—_—---- $10, 000 


Fig.5.2 Schematic of the two options for receiving $10,000. The amounts shown in blue are the 
equivalent values three years later for option A, or three years earlier for option B 


promise to give it back to you later, plus some additional money, called interest. 
Indeed, that is what banks do with the money you ‘loan’ them to save for you. 

Having money now rather than later is worth paying for by those who need that 
money now. Those who borrow money, say from a bank, usually have to pay it 
back later with interest. What they payback is more than what they borrow. This is 
true even for banks in countries where earning interest by individuals is considered 
unethical. Otherwise, how could those banks survive? 

By receiving $10,000 today, you can increase the future value of your money 
by investing it and gaining interest over time. If you invested it in a savings bank 
for three years, you would have the $10,000 plus the interest earned that the bank 
pays you for the use of your money over that time. If you wait until the end of 
three years to get the $10,000 cash prize, all you will have is the $10,000. The 
interest the bank pays you is based on the amount you give to them to save for you 
and the time they have used it. The interest rate is usually expressed as a percent 
of that amount per unit time period, typically a year. The interest rate is commonly 
denoted by the fraction, or percent, i. 
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Example: Assume that the interest is paid at the end of each year based on 
the amount invested in the savings account at the beginning of the year. If the 
annual interest rate is 4.5%, then at the end of the first year your $10,000 becomes 
$10,000 (1 + 0.045) = $10,450. The interest earned that year is $450. 

If the $10,450 in your investment account at the end of the first year remains 
for another year, at the end of that second year you would have that plus another 
year of interest: $10,450(1 + 0.045) = $10,920.25. 

This value at the end of the second year is 


$10,000 x (1+ 0.045) x (1+ 0.045) = $10,000 (1 + 0.045)”. 
Investing this amount for three years would give you 
$10,000(1 + 0.045)? = $11,411.66. 


The annual interest rate of 4.5% is a compound interest rate, as interest is 
reinvested and earns interest along with the initial investment, the principal. If the 
interest earned is removed from the savings account each year, the 4.5% interest 
rate is called a simple interest rate. The total amount one would accumulate after 
n years of investing at 4.5% simple interest rate per year would be $10.000 (1 + 
n(0.045), i.e., the $10,000 principal plus n years of $450 interest payments. 


5.3 Computing Present Values of Future Cash Flows 


If you received $10,000 today, the present value would of course be $10,000. If 
$10,000 were to be received in a year, the equivalent present value of the amount 
now at the beginning of the year would not be $10,000 but rather the amount if 
invested today would total $10.000 in a year. And that depends on the interest rate 
you can earn on that investment. 

Letting Pg be the present value (at the end of year 0) and F, be the future value 
at the end of year n, the basic equation for finding either Pp, Fn, or the assumed 
constant compound interest rate i, is (Fig. 5.3) 


Fa = Po(1+ i)". 
Fig.5.3 Cash flow diagram 


showing present and 
equivalent future values 


aFn 


Fig.5.4 Distinguishing 
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between compounding a 
present value into the future 


a? 
and discounting a future 
value to the present 


Tu 


Discounting 


Finding a future amount at the end of period n, Fp, given a present amount, 
Po, and period interest rate i, is called compounding into the future. The oppo- 
site is called discounting a future value to the present. The distinction between 
compounding and discounting is shown in Fig. 5.4. 

One can use the above single payment compound amount equation to find that 
$8,762.97 invested today at an annual compound interest rate of 4.5% for three 
years will equal $10,000 at the end of that third year, assuming again that interest 
is paid and reinvested at the end of each year. $8,762.97 is the present value 
of $10,000 at the end of three years. $10,000 is the future value of $8,762.97 
invested today. Both statements assume an annual compound interest rate of 4.5% 
with interest paid at the end of each year. 

What if in option B the cash prize payment in three years is more than $10,000, 
the amount you would receive today in option A? Say you could receive either 
$10,000 today (option A) or $13,000 at the end of three years. Which would you 
choose? The decision is now less obvious. If you choose to receive $10,000 today 
and invest the entire amount, you may actually end up with an amount of cash at 
the end of three years that is less than $13,000. To decide which option is better 
you could compute either the future value of $10,000 three years from now and 
compare it to the $13,000, or compute the present value of $13,000, and compare 
it to the $10,000. 

For example, if interest rates are currently 4%, using the above equation, the 
equivalent present value of $13.000 three years from now is $11,556.95. Thus, the 
choice is between $10,000 and $11,556.95. Most would choose to postpone prize 
payment for three years. If you really needed $10,000 today and could borrow it 
at an annual interest rate less than 9%, you would be able to pay off the debt in 
three years and still have some leftover. 


5.4 Computing Equivalent Constant End-of-Period Amounts 


Many benefit-cost calculations use annual costs and benefits. For example, if you 
want to borrow $200,000 to buy your first house, you typically go to a bank and 
get a loan. The bank tells you how much money you need to pay the bank, in equal 
payments, A, at the end of each year for a given number of years, to pay back the 
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Fig.5.5 Cash flow diagram 


for a constant end-of-period | | | | 
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present value of Po 


loan plus interest. To calculate this constant annual amount, A, paid at the end of 
each year, we find the sum of the present values of each of those annual payments 
of A and equate that sum to the original present value of debt of $200,000. If n is 
the number of years of payments 


Po = 200,000 = A/Q +i) +A/A +D? + A/1 +D? +--- +A/ +0. 
This is equivalent to 
Po = A[(1+i)"—1]/[ i + i)"]orA = Pof ia +a +D" 1) }. 


This is how the banks determine what you owe to pay back a loan with equal 
end-of-period payments over n time periods assuming an interest rate of i per 
period. The period most banks use is a month, not a year. If i represents an annual 
interest rate, the monthly rate is i/12 (Fig. 5.5). 

When one gives money to an organization’s endowment, they usually expect 
it will provide income to that organization forever. The end-of-year annual equal 
payment A from an endowment of Po that can be paid forever can be calculated 
using the above equation when n goes to infinity. The result is the same as if 
simple interest were being used. The equal annual payment A = P (i). 


5.5 Within-Year Compounding 


If you are saving money in a bank savings account, the interest you earn each day 
is the minimum amount you have in your account that day times the daily interest 
rate. This daily rate is the annual ‘nominal’ rate (say 5%) divided by 365. This 
daily rate can be applied in any of the above equations, where instead of the time 
period being a year, it is a day. 

Hence, F; at the end of a day = Po (1 + annual nominal interest rate/365). 

This is daily compounding. Interest is earned and paid to the account each day. 

F365 at end of a year of daily compounding = Pp at the beginning of the year 
times the factor (1 + annual nominal rate/365)>°. 

The future value after n years of daily compounding at a nominal annual rate 
of 5% is 


Fa = Po(1 + 0.05/365)°° 2, 
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If r is the nominal annual interest rate, but compounding occurs in each of m 
equal periods within a year, then the corresponding effective annual rate i that 
assumes compounding occurs only once in a year is 


(l +i)! = (1 +r/m)™ ori = (1 +r/m)™ — 1. 


The annual effective rate i associated with within-year period compounding is 
clearly greater than the annual nominal rate r. For example, monthly compounding 
at a nominal annual interest rate of r is equivalent to annual compounding at an 
effective interest rate of (1 + 1/12)!? — 1. 

Daily compounding, which many bank savings accounts offer, is almost equiva- 
lent to what is called continuous compounding ~ compounding every nanosecond! 
If the nominal annual rate of interest is r, the corresponding effective continuous 
compounding annual rate turns out to be e" — 1, where e is the base of natural log- 
arithms, e = 2.718281828. The factor (1 + i) becomes (1 + e" — 1) or (e"). Thus, 
for continuous compounding over n years, an investment of Po at the beginning of 
year 1 (or end of year 0) will yield 


Fn = Po(e')" 


at the end of n years. 


5.6 Inflation 


Prices of goods and services usually increase over time. This is called inflation. 
The actual rate of inflation varies depending on the item. The increase (or decrease) 
in home prices is not the same as, for example, the increase in university tuition. 
General consumer price index (CPI) inflation rates mentioned in the media are 
commonly based on the prices of a set of goods and services that are included in 
the CPI. The rate of inflation varies over time, of course, just like interest rates. The 
inflation rate is commonly designated by the letter f. Hence, assuming an annual 
inflation rate of f, something that costs $100 today will cost $100(1 + f)” at the 
end of n years from now. If there is no other reason to invest money, it is to keep 
up with inflation. Otherwise, even if you have the same amount of money now 
and n years from now, you will be poorer then in the sense you will not be able to 
buy as much then as you can now with that amount of money. Obviously, one tries 
to build wealth at a rate greater than the rate of inflation. Taking into account the 
effects of inflation, the ‘real’ uninflated rate of return, r, on any investment earning 
an interest rate of i is (Fig. 5.6) 


(+n) = (14+i/+forr= 14+i/14+f)-1. 


The real rate of return, r, is often called the true or real time value of money. (Do 
not confuse this r with the r denoting the nominal annual interest rate applicable 
to within-year compounding). 
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Fig.5.6 Impact of 4% inflation on the purchasing power of today’s $1 over the next 25 years 


To compute the inflation adjusted annual payments so that each payment has the 
same purchasing power, the real uninflated interest rate r can be used to compute 
the constant payment A, and then each A is inflated at the time of payment. Hence, 
instead of using 


A= Po{ i +0"/((1+0"- 1) }, 


use the real rate of return r in that equation in place of i to compute A and then 
inflate it at the time of payment. 


Ay = the actual payment at end of yearn = A(1 +f)". 


5.7 Income Taxes 


In addition to wanting the interest rate you are getting on your investments to be 
greater than the rate of inflation, you also want it to be greater than the inflation 
rate after you paid your income taxes on the interest earned. The net interest rate 
after taxes depends on the tax rate. Letting t be the tax rate, then the net interest rate 
after taxes is i(1 — t). This expression assumes you pay the taxes when the interest 
is earned. Even though this is rarely the case, it is a good enough assumption for 
most economic calculations we will be performing (Fig. 5.7). 

Thus, the future value, Fp, after taxes, on an investment of Pg for n years at an 
annual before tax interest rate i will be 


Fa = Po +i —1))". 
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Fig.5.7 There are only two 
things that are certain in life: 
death and taxes. November 
13th, 1789, Benjamin 
Franklin. http://www.clker. 
com/cliparts/4/9/£/1/15 1676 
05761546791 1 5death-and- 
taxes-clipart.hi.png, http:// 
www.clker.com/clipart-744 
320.html. Public domain 


If the investment is placed in a tax-deferred account, the income tax is paid 
only when the money is withdrawn, say at the end of n years. In this case, the 
after-tax amount will be 


Fa = Po(1 +i)" — [Po +i)" — Po] QorPo[ (i +i" —8 + t]. 


Obviously, if you can do this, tax-deferred investments offer more at the end of 
such investment periods than do accounts where taxes have to be paid each year. 
But this may depend on the tax rates that can differ over time as well. 

In any event, unless the rate of interest one earns exceeds both the inflation and 
tax rates, the monetary gains recorded in bank statements over time will be losing 
purchasing power. 


5.8 Comparing Alternatives 


It is important to know how to calculate the value of money over time so that 
you can distinguish between the worth of alternative investments that offer dif- 
ferent returns, or costs and benefits, at different times over different time periods. 
Remember that you cannot move money around over time without using the appli- 
cable interest rate unless, of course, it is 0. $100 today is not the same as $100 
tomorrow. To compare different alternatives having different time streams of costs 
and benefits, we must move money around over time to compute equivalent present 
values, Po, future values, Fy, or annual equal end-of-year values, A. When doing 
this comparison of alternatives, one must be considering what to do with the same 
amount of money invested (costs) over the same amount of time for all alternatives 
being compared. 

For example, consider the following. There are two alternatives, A and B, 
that involve different initial investments. These initial investments along with the 
present values of the future net benefits are given in the table below. Both the net 
present values and the present benefit/cost ratios are also shown. You will see that 
based on an objective of maximizing net benefits alternative A is best. But based 
on the objective of maximizing the benefit/cost ratio alternative B is best (Table 
5.1). 
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Table 5.1 Costs and benefits 


Š Alternative A B 
of two alternatives 
Present value of costs 40 10 
Present value of benefits 50 15 
Net benefits 10 5 
Benefit/Cost ratios 5/4 3/2 


Both the net benefit and the benefit/cost criteria should indicate the same best 
alternative. What is missing in this analysis? 

In this example, the issue is how best to invest the $40 that is apparently avail- 
able since alternative A is being considered. So the issue is what to do with the 
$40. The amount left over after investing 10 in alternative B is 30 and this plus 15 
is the present value of the benefits. Thus, the benefit/cost ratio for alternative B is 
really 45/40 = 9/8. This is less than the benefit/cost ratio of 5/4 for alternative A, 
and hence based on both the net benefit and benefit/cost criteria, alternative A is 
best. 


5.9 Investing for Retirement 


Assume you can invest $5500/year of earned income into a tax-free account. In 
the US, it might be a Roth Individual Retirement Account (IRA). Also, assume 
that you can start investing at age 25 and you plan to retire 40 years later at age 
65. Finally, assume that you can earn an average annual rate of interest of 8% 
over the 40-year period. Investing $5500 at the beginning of a year will result in 
5500(1 + 0.08) = $5940 at the end of the year. Interest earned is $5500(0.08) = 
$440.00 and is tax free when it is withdrawn after you retire. At the beginning of 
the second year, you invest another $5500 in the account. At the end of two years 
of investing, you have 


($5940 + $5500) (1 + 0.08) = $12,355.20. 
At the end of three years of investing $5500 at the beginning of each year 
($12355.20 + 5500) (1 + 0.08) = $19,283.62. 


Notice the model one can use to compute how much you will have, Fy, at the 
end of n years of investing P at the beginning of each year, at 8% per year, is 


Fı = P(1 + 0.08), 
Fy = (Fi + P)C + 0.08), 
F3 = (Fz + P)(1 + 0.08)... 
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and so on for each year y until y = n. This model can be written as 
Fy = (Fy-1 + P)( + 0.08) for y = 1, 2,3, ..., nand Fọ = 0. 


In this example, all the beginning-of-year investments, P, are $5500. There are 
more elegant ways of computing any Fn, but the above sequence of equations, 
solved sequentially for each time period y, works. At the end of 10 years of invest- 
ing $5500 at 8% per year, you will have $86,050.18. After 30 years of investing, 
you will have $672,902.30. 

Consider two options: 


(a) Invest $5500 at the beginning of each year starting at age 25 and stop after 
10 years but keeping the total accumulated amount ($86,050.18) in the account 
earning 8%/year for the next 30 years. At the end of the next 30 years, at age 
65, the amount in the account will be $86,050.18 (1 + 0.08)°° = $865,893.40 
for a total investment of 10($5500) = $55,000. 

Start Investing $5500 at the beginning of each year beginning at age 35, for 
the next 30 years, using the same model as described above. The total amount 
at the end of the 30 years, at age 65, will be = $672,902.30, based on a total 
investment of 30($5500) = $165,000. 


(b 


wm 


You invest more ($165,000 — $55,000) and get less ($865,893.40 — $672,902.30) 
using option ‘b’ than if you use option ‘a’. Of course, investing over the entire 
40 years of your working life will give you a total of $865,893.40 + $672,902.30 
= $1, 538,796. 

That amount of money may seem like a lot, but will it be enough when you 
retire? At the end of 40 years, the price of what you might want to buy will be 
more than what it is now. For an annual inflation rate (fraction) of f, what you 
could buy for a dollar at age 25 after 40 years will cost (1 + f)*° dollars. You 
can see that if the inflation rate f is say 3% per year, you will need $17,941.21 
40 years from now to buy what $5500 could buy today. The message: Needing 
money for retirement is real. So is inflation. Hence, how to invest now to be ready 
to retire sometime in the future with the desired lifestyle is worth some thought 
and planning, and as the previous example shows, the sooner the better (Fig. 5.8)! 


Fig.5.8 Retirement. How 
much will you need to 
implement it? 
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Exercises 


1. What is $1 invested today at 7% per year, compounded annually, worth at the 
end of 10 years? 

2. How long will it take to double your investment if it is earning 10% per year 

3. What is the value of $1 invested for a year if compounded at 1% per month? 

4. What would be the answer to the previous question if an annual nominal interest 
rate of 12% were compounded continuously within the year? 

5. Suppose after you graduate and begin receiving an income you start investing 
$6000 each year into a tax-free retirement account that earns 8% per year. You do 
this for only 10 years, and then just leave it in the account earning 8% interest for 
the next 30 years when you decide to retire. Alternatively, you only start investing 
$6000 per year into this tax-free account on the 11th year of employment and 
keep investing annually for the remaining 30 years. Which investment strategy 
will result in a higher retirement fund at the end of 40 years of employment? 

6. How much money are you going to need when you retire to assure you can meet 
your standard of living for the remainder of your life? Specify all the assumptions 
you are making, taking into account taxes and inflation. How are you going to 
get that amount of money (i.e., your savings plan?). 

7. One criterion for plan selection is maximum net annual benefits. The maximum 
benefit-cost ratio, or annual benefits divided by annual costs, is another criterion. 
Benefit—cost ratios should be no less than one if the annual benefits are to exceed 
the annual costs. Consider two projects, I and II: 


Project 

I II 
Annual benefits 20 2 
Annual costs 18 1.5 
Annual net benefits 2 0.5 
Benefit/cost ratio 1.11 1.3 


What additional information is needed before one can determine which 
project is the most economical project? 

8. Bonds are often sold to raise money for infrastructure project investments. Each 
bond is a promise to pay a specified amount of interest, usually semiannually, 
and to pay the face value of the bond at some specified future date. The selling 
price of a bond may differ from its face value. Since the interest payments are 
specified in advance, the current market interest rates dictate the purchase price 
of the bond. 

Consider a bond having a face value of $10,000, paying $500 annually for 
10 years. The bond or ‘coupon’ interest rate based on its face value is 500/10,000, 
or 5%. If the bond is purchased for $10,000, the actual interest rate paid to the 
owner will equal the bond or ‘coupon’ rate. But suppose that one can invest 
money in similar quality (equal risk) bonds or notes and receive 10% interest. 
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As long as this is possible, the $10,000, 5% bond will not sell in a competitive 
market. In order to sell it, its purchase price has to be such that the actual interest 
rate paid to the owner will be 10%. In this case, what is the bond currently worth? 
The interest paid by some bonds, especially municipal bonds, may be exempt 
from state and federal income taxes. If an investor is in the 30% income tax 
bracket, for example, a 5% municipal tax-exempt bond is equivalent to about 
a 7% taxable bond. This tax exemption helps reduce local taxes needed to 
pay the interest on municipal bonds, as well as provides attractive investment 
opportunities to individuals in high tax brackets. 
Assume a particular university’s tuition and fees are $C today. 
Assume the after-tax interest rate you can earn in the next 24 years is 5%. 
Assume the inflation rate of tuition and fees in the next 24 years will be 4%. 
Show how to determine how much would be enough to invest today to pay 
for four years of tuition and fees starting at the beginning of 20 years from now. 
Just set up the equations needed to find the answer. Drawing a picture may 
help. 


. You must pay back a bank debt, say of $1000, with interest, in 12 equal end-of- 


month payments. Each monthly payment contains both some of your debt and 
the monthly interest owed on the remaining debt. The bank tells you the annual 
interest rate is 5%. Describe how you could determine the annual interest rate 
you actually paid on the debt you owed. 


. You are considering taking flying lessons that if begun today will cost $10,000. 


Alternatively, you could wait a year to begin the lessons after paying the fee 

(that is likely to be higher) at that time. 

(a) If you decide to wait a year and invest the $10,000 during the year, earning 
an annual interest rate i, describe how would you determine the extra money 
you would have at the end of the year after paying the inflated cost of lessons 
at that time? 

(b) Assume you forgot to consider the fact that you will owe income taxes on the 
interest earned. Your income tax rate is t. How would your analysis change 
so as to include the tax payment? 


. You must pay back a bank debt, say of $1000, with interest, in 3 equal end-of- 


year payments. Each payment contains the interest on the debt at the beginning 
of the year and some of the principal. 

(As the debt decreases so do the interest payments in each successive A. The 
interest paid, Iy, at the end of a year y is based on the debt, Py-1, at the beginning 
of that year.) 

The bank tells you the annual interest rate is 5%. 

Show how to compute the principal and interest contained in each of the three 
end-of-year payments ‘A’ using the following steps: 

(a) Write the equation for solving for payments A: 

(b) Show the equation for computing for the first interest payment, I): 

(c) Given A and Ij, show the equation for computing for the remaining debt at 
beginning of 2nd year, P1: 

(d) Show the equation for computing for the interest paid in the 2nd payment: 
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(e) Given A, Pı, and Ij, solve for the remaining debt at beginning of 3rd year: 
You can deduct 30% of the annual interest payment from your income tax each 
year. Given all the interest payments I, and A, show the equation you could use 
to compute the actual interest rate you are paying on your debt. 
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Solving Models Using Excel 


ABSTRACT 


This chapter offers examples illustrating how the ‘Solver’ feature of Excel can 
be used to solve simultaneous equations and unconstrained and constrained opti- 
mization models. This and other features of Excel can be used to solve any of 
the optimization or simulation models or equations introduced in this book. 


6.1 Introduction 


Recall the model developed in Chap. 3 for finding the dimensions of a tank that 
minimized its cost, or the model introduced in Chap. 4 for estimating the most ben- 
eficial way of allocating scarce resources to multiple users. In each case, there were 
multiple possible solutions, and the best solution was not obvious. These situations 
motivate the development of optimization models but the models themselves are 
of little value unless they can be solved. This book introduces ways of developing 
and solving optimization models. Each method has its advantages and limitations, 
as was evident for the hill climbing approach presented in Chap. 4. This chapter 
shows how optimization models can be solved using ‘Solver’ contained in the 
Microsoft spreadsheet program Excel. 

Software programs such as Excel change over time. Hence, what is described 
in this chapter is only an outline of what is needed to be able to use Solver and 
take advantage of other capabilities of Excel when solving optimization models. It 
reflects the version of Excel available when this book was written. This chapter is 
not a substitute for the documents available from Microsoft and others that explain 
Excel’s features in more detail. 
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6.2 Using Solver in Excel 


To use Excel to solve optimization problems, we need to use ‘solver’. If it is not 
already available under the Data menu item, it must be installed. To do this, find 
and click on ‘Options’ under ‘File’. Then find and click on ‘Add ins’. Then find 
and click on ‘Solver Add in’. Once this ‘Solver Add in’ line is highlighted, click 
on ‘Go’ at the bottom of the page. The following dialog box will appear. As shown 
below, click on the box next to “Solver Add-in’ and then ‘OK’. Then you can go 
to the ‘Data’ page of Excel, and you should see ‘Solver’ at the far right of the top 
row of menu items (Fig. 6.1). 

The following examples are used to illustrate how the optimization component 
in Excel works. 


1. Benefit—cost analysis: 


Assume a decision variable x can range between 0 and 12. Any value of x will 
yield benefits and incur a cost. The benefit function for this decision variable is 
80x. Its cost function is 7 + 4x15. Given these functions as shown in Fig. 6.2, 
the optimization problem is to determine the value of x that maximizes the benefits 
less the costs, i.e., the net benefits. 

Before entering this optimization model into Excel, we can also include equa- 
tions that define the slopes of the benefit and cost functions associated with any 
value of x. As one can see from Fig. 6.2, when the net benefits are at their max- 
imum value, the slopes of the benefit and cost functions are equal. We can use 
Excel to not only find the best value of x, but also verify that at that value, the 
marginal benefits equal the marginal costs, i.e., the slopes are the same. 

Using calculus, which will be described later in Chap. 10, we can find the equa- 
tions that define the marginal values or slopes of these benefit and cost functions 


Fig.6.1 Dialog box used to 


select Solver to be installed 
in Excel Add-ins available: 


E Analysis ToolPak 
Analysis ToolPak - VBA 
ILJ Euro Currency Tools Cancel 
(Sover Add-in 
Browse... 
Automation... 


Solver Add-in 
Tool for optimization and equation solving 
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Fig.6.2 Benefit and cost 350 
functions together with the 


š 300 
net benefit function 


0 123 4 5 6 7 8 9 10 11 12 


——Benefits ———Cost =— Net benefits 


at any value of x. 


marginal benefit = (0.55)(80) x557) 


marginal cost = a.5 (4x5 D, 


Next, we can set up the model in Excel: Fig. 6.3 illustrates one way to do this. 

Once the model is entered into the Excel spreadsheet, we can find the optimum 
(maximum net benefit) solution by clicking on the Solver menu item, which again 
is among the menus found under the Data menu. The dialog box shown in Fig. 6.4 
will appear. 

In this example, the cell containing the objective function is F5. It is to be 
maximized. The value of the decision variable x is in cell E6. There are no con- 
straints. The non-linear solver is to be used to find the best solution since the 
model is non-linear. Solver assumes that all unknown variables are non-negative 
unless otherwise specified in the constraint section. 

Clicking on Solve (having the blue border in Fig. 6.4) results in the solution 
shown in Fig. 6.5. 


| aA B c D E F G H 
1 |Benefit Cost Analysis 
2 
3 Benefit Function: 80*x^0.55 in F3 0 
4 | Cost Function: 7+4*x“1.5 in F4 7 
S| Net Benefits in F5 7 
6 Variable x in E6 | ol 
7 
8 Marginal Benefits = (0.55) *(80)*x(.55-1) in H8 K #DIv/0! 
9 Marginal Costs = (1.5*4*x^(1.5-1) in H9 0 


10 | 


Fig.6.3 Model for finding Net Benefits entered into an Excel spreadsheet 
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Solver Parameters 


Set Objective: SFSS 

Te O Min O yalue Ot: o 
By Changing Variable Cells: 

SES6 


Subject to the Constraints: 


© Make Unconstrained Variables Non-Negative 


Sglect a Solving GRG Nonlinear 
Method: 


Solving Method 


Select the GRG Nonlinear engine for Solver Problems that are smooth nonlinear. Select the LP 
Simplex engine for linear Solver Problems, and select the Evolutionary engine for Solver 


problems that are non-smooth. 


Fig.6.4 Dialog box for identifying the type of optimization, the function to be maximized or min- 
imized or for just finding any solution, the unknown decision variables in the model, the method 
used for optimization, and the constraints, if any 


Ca ES A 
1 [Benefit Cost Analysis 
2 
3 Benefit Function: 80*x*0.55 in F3 253.5444 
4 Cost Function: 7+4*x‘1.5 in F4 99.96627 
si Net Benefits in F5 
6 Variable x in E6 
7 
8 Marginal Benefits = (0.55)*(80)*x^(.55-1) in H8 
9 Marginal Costs = (1.5*4*x^(1.5-1) in H9 
10 


17.12273 
17.12273 


Fig.6.5 Solution of Benefit-Cost model in which the net benefits, cell F5, is a maximum 
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Note that the net benefits are a maximum when x is 8.144. At this x value, the 
slopes of the benefit and cost functions are the same, namely 17.123. Knowing 
that this condition will always apply, unless constrained otherwise, the value of x 
could have been obtained by simply equating the marginal values and solving for 
x. This would require adding the constraint that equates the two marginal values, 
as illustrated in Fig. 6.6. 

Clicking on Solve in the dialog box shown in Fig. 6.6 will result in the same 
output as shown in Fig. 6.5. 


2. Designing a cylindrical tank. 


This second example involves determining the least-cost dimensions of a cylindri- 
cal tank. The design variables are the radius and the height. The known parameters 
are the unit (per unit area) costs of the side area, the top area, the bottom area, the 
required volume, and the constant pi (7). 

This optimization problem has a constraint requiring the volume to be at least 
equal to 100 units. 


i 


Set Objective: fe 


To: O Max O Min @ Value Of: 0 


By Changing Variable Cells: 
SES6 ES: 


j 


Subject to the Constraints: 
SHSS = SHS9 Add 
Change 


Delete 


Reset All 


Load/Save 
[2] Make Unconstrained Variables Non-Negative 
Select a Solving GRG Nonlinear we Options 
Method: 


Solving Method 


Select the GRG Nonlinear engine for Solver Problems that are smooth nonlinear. Select the LP 
Simplex engine for linear Solver Problems, and select the Evolutionary engine for Solver 
problems that are non-smooth. 


Fig.6.6 Solving the benefit-cost model by simply equating the marginal benefits and costs. This 
requires the constraint shown in the constraint section of this dialog box 


70 6 Solving Models Using Excel 


A 8 c D E F G H 1 J K t M N o P Q R 
1 [Circular Tank Design 


Variables: 
Radius 
Height 


3 

4 

5 < m e S 

6 Bee Sy Set Omecore sm X 
7 Parameters Functions ~~ = ead 

8 Components Unit Cost Area  TotalCost -—__ T ap e paasa 

9 side e) 87.03684 3481.573 E. 


10 Botto 20 16.59122" 1659.122 a 
" Top 5 16.59122 82.95612 

1 Pi 3.142857 

13 Required Volume do 

“4 Actual Vol = 


Total Cost= 


Microsoft Excel 16.0 Sensitivity Report 
Worksheet: [Booki]Sheet1 


è ” —_— Solver found è solution. All Constraints and optimality 
Report Created: 5/12/2020 2:53:09 PM ne laco sotien n e! 4 | 
pa Anae 
© seep serrer toivon O 
Ums 
Variable Cells O Besneve Original Valves 
Final Reduced 
Cell Name Value Gradient C Return to Solver Parameters Dialog C outtine Reports 
$D$4 Radius 2.297613081 (] 
$D$5 Height 6027281337 o [* ox | cne Seve scenario 
Constraints Solver found a solution. All Constraints and optimality conditions are satisfied. 
Final Lagrange Wren the G&G engine is used. Solver has found at least a loca! optima! solution When Simpies LP 
Used, this means Solver has found » gi ima! sout 
Cell Name Value Multiplier Re Nae a SA 


$1$14 Actual Vol= 99.99997576 34.83014671 


Fig.6.7 Setting up and solving for the least-cost values of the radius and height of a circular tank 


Figure 6.7 shows the Excel model and the steps needed to define the objective, 
the decision variables, and the constraint. It also shows how to get the sensitivity 
information related to the constraint, called the Lagrange Multiplier. Its value indi- 
cates the additional cost if the volume were increased by one unit (i.e., the slope 
of the total cost function at the optimal value of the radius and height). It is also 
called the shadow price or dual variable as discussed in Chap. 4. 

The first step is to define the model variables, and parameters, and functions in 
any way that makes it clear where their values will be shown. This is shown in the 
upper-left portion of Fig. 6.7, except in this case, where the values shown are the 
ones obtained after the solution is known. When setting up the model, most of the 
values of the decision variables and functions will be 0. 

Once the model is complete, select Solver and fill in the dialog box as shown 
in the upper right of Fig. 6.7. To add a constraint, select the ‘Add’ button in the 
constraint section of the dialog box and another dialog box will appear as shown 
just under the model. After entering the constraint, clicking on OK will make that 
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constraint appear in the larger dialog box as shown above. Clicking on ‘Solve’, if 
there are no errors, will result in the dialog box shown at the bottom right of the 
figure. Selecting ‘Sensitivity? (as shown in blue) will generate the report shown at 
the bottom left of the figure. That report will be on a separate page of the Excel 
file. This option will be demonstrated in the next example problem. 


3. Resource allocation. 


This example problem is to find the allocations X, Y, and Z to three users that 
maximize the total benefits obtained, given only 6 units of resource available. The 
benefit functions for each use are: 


B1=6%X — X’; B2 =7 xY —15%Y7; B3 = 8 * Z — 0.5 * Z’. 
The objective is to maximize B7 + B2 + B3 
Subject to: X +Y + Z <6. 


This is the same problem that was used to illustrate the hill climbing approach 
in Chap. 4 for solving models that contain continuous concave objective functions 
for maximization, or convex functions for minimization. Here we use Excel to 
solve the same model. In this case, we can assume each allocation is a continuous 
variable, rather than a discrete variable as was assumed for hill climbing (Figs. 6.8 
and 6.9). 


A 8 c D E F G H 1 J K t M N o P 

1 (Resource Allocation 

2 

3 Variables: Value 

4 x= (J 

5 Y= 0 Set Objective: SPSI2 fs 
6 Z= ° 

7 Parameters Functions To: @ Max Ome O Value OF 

8 Components 

i: 
9 c= o By Changing Variable Cells: = 
10 B2= o 5054:5056 i 
“u = 
a j À Subject to the Constraints: 
< pez SFS14 <= SF513 aad 
13 Resources Available = 6 
1 Resources Allocated = o 
Change 
15 
16 Delete 
17 
B Reset All 
19 
20 hoad/Save 
21 E Mage Unconstrained Variabies Non-Negative 
22 
Selecta Solving GRG Nonlinear v t 

23 nee Options 
24 
25 Sohing Method 
26 Select the GRG Nonlinear engine for Solver Problems that are smooth nonlinear. Select the LP 
27 Simplex engine for linear Solver Problems, and select the Evolutionary engine for Solver 
= problems that are non-smooth. 
29 


Fig.6.8 The resource allocation problem is set up for solution using Solver in Excel 
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A 8 © D E F G H 1 '] K t M N o p 
1 Resource Allocation 
2 
3 Variables: Value 
4 xz 1 Solver found a solution. All Constraints and optimality 
5 Y= 1 conditions are satisfied. 
6 zs 4 
7 Parameters Functions ld iaa 
8 SEESE O estore Originat vasses 
9 Bi= 5 
10 B2= 55 
" 83= 24 C Return to Solver Parameters Dialog O Oxtline Reports 
12 Total = 24.5 
13 Resources Available = 6 
14 Resources Allocated = 6 Le ] annas Save Scenario. 
15 
16 Solver found a solution. All Constraints and optimality conditions are satisfied. 
7 
18 When the GRG engine is used, Solver has found at least a local optimal solution. When Simplex LP 


is used, this means Solver has found a global optimal solution. 


Fig.6.9 Solution of the resource allocation problem, and dialog box used to access the solution 
shown on left and sensitivity reports shown in Fig. 6.10 


Fig.6.10 Sensitivity report Microsoft Excel 16.0 Sensitivity Report 
associated with the resource Worksheet: [Book1]Sheet1 
allocation model Report Created: 5/12/2020 7:05:34 PM 


Variable Cells 


Final Reduced 

Cell Name Value Gradient 
$0$4_X=Value 1000000008 0 
$0$5_Y=Value 0.999999993 0 
$D$6 Z=Value 3.999999999 0 

Constraints 

Final Lagrange 

Cell Name Value Multiplier 
$F$14 Resources Allocated = Functions 6 3.999998093 


6.3 Conclusion 


This chapter and its examples serve just as an introduction to using the Solver 
within Excel to find solutions to simultaneous equations or to constrained or 
unconstrained optimization problems. There is much more to learn besides what 
has been demonstrated here, and some of these additional features will be covered 
as we work through various policy problems introduced in the following chapters. 

Relying on a computer to solve problems does not eliminate the need to think. 
Steve Jobs suggests programming a computer, and we assume that may also apply 
to using Excel, helps us think. 


“Everybody in this country should learn to program a computer... because it teaches you 
how to think”. 
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Steve Jobs, co-founder and CEO of Apple, Inc. (1995-2011) 


Exercises 


1. Regression involves finding functions that best fit some observed data. One cri- 
terion is to minimize the sum of squared deviations from observed and predicted 
values. Suppose you have a set of observed (known) x, y values, say x(i) and 
corresponding y(i). 


y(i): 4 10 18 11 22 7 10 14 193 
xG):24 8 6 1035 79 1 


Define and solve an optimization model to determine the parameters of a 
non-linear function y = a + bx“ that best fit the above data. 


2. Find the four linear functions that best fit the following four sets of data. Then 
plot the data. What does this tell you about fitting functions to data? 
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Anscombe's quartet 
| lI Ill IV 
x y | x y x y | x y 
10.0 | 8.04 | 10.0 | 9.14 | 10.0 | 7.46 | 8.0 | 6.58 
8.0 | 6.95 | 8.0 | 8.14 | 8.0 | 6.77 | 8.0 | 5.76 
13.0 | 7.58 | 13.0 | 8.74 | 13.0 | 12.74 | 8.0 | 7.71 
9.0 | 8.81 | 9.0 | 8.77 | 9.0 | 7.11 | 8.0 | 8.84 
11.0 | 8.33 | 11.0 | 9.26 | 11.0 | 7.81 | 8.0 | 8.47 
14.0 | 9.96 | 14.0 | 8.10 | 14.0 8.84 | 8.0 | 7.04 
6.0 | 7.24 | 6.0 | 6.13} 6.0 | 6.08 | 8.0 | 5.25 
4.0 | 4.26 | 4.0 3.10) 4.0 | 5.39 19.0 | 12.50 
12.0 | 10.84 | 12.0 | 9.13 | 12.0 | 8.15 | 8.0 | 5.56 
7.0 | 4.82 | 7.0 | 7.26} 7.0 | 6.42 | 8.0 | 7.91 
5.0 | 5.68 | 5.0 | 4.74/ 5.0 | 5.73 | 8.0 | 6.89 
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Discrete Optimization Modeling 


ABSTRACT 


Using examples, the chapter introduces discrete dynamic programming that con- 
verts an overall optimization problem into many simpler sub-optimization prob- 
lems. The chapter discusses the advantages and limitations of this optimization 
method. 


7.1 Discrete Dynamic Programming 


When most read the word ‘programming’ they typically think of computer pro- 
gramming, creating a set of instructions that tell a computer how to perform a 
task. The term ‘mathematical programming’ refers to algorithms (methods) used 
by computers or manually to solve constrained optimization problems. The term 
refers to ways of solving constrained optimization models. In Chap. 4, the hill 
climbing method was introduced as an approach for solving discrete optimiza- 
tion problems. Hill climbing is one of many mathematical programming methods. 
Recall that this method only works if the functions to be maximized are con- 
tinuous and concave, or convex if they are to be minimized. But what if those 
conditions are not satisfied? A mathematical programming method that is avail- 
able for solving discrete optimization problems where the objective functions can 
be discontinuous, and of any shape, is called discrete dynamic programming. 

Dynamic programming is an approach that transforms discrete multi-variable 
multi-stage optimization problems into networks of nodes and links and then 
solves for the best paths through such networks. Stages could be time periods 
or locations or activities. The nodes represent discrete states of the system that 
can exist at each stage either before or after a decision has been made. The links 
connecting those nodes in successive stages represent discrete decisions that are 
feasible, given the state of the system. 
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For example, recall the resource allocation problem introduced in previous 
chapters. The problem involved finding the allocations of resources to multiple 
users that maximized the total benefits derived from those allocations. Think of 
each user as being at a different location and an allocation decision process that 
proceeds in steps from one user to the next. The first step begins with deciding how 
many resources to allocate to the first user. Then, with the resources remaining, the 
second step involves making an allocation to the second user. Finally, with what 
resources remain, the third step is to make an allocation to the third user. Each 
step is called a stage of the dynamic programming process. The remaining avail- 
able resources are a state of the system, represented by nodes. The links represent 
allocation decisions. A network representation of this process defines all possible 
discrete alternative allocations at each stage to each remaining user. The discrete 
dynamic programming procedure is a way of identifying the best path through this 
discrete network. 

Converting an optimization problem into a discrete network of nodes and links 
representing different discrete states and decisions at each stage is the main chal- 
lenge in using dynamic programming. Solving for the sequence of best decisions 
once a network is constructed is relatively easy, as will be shown for the following 
several example optimization problems. 


7.1.1 Traveling Problem 


Figure 7.1 could represent a map showing possible routes from the first state, 
node 1, to the end state, node 10. The problem is to find the best route from node 
1 to 10. In this case, the states are just locations. The links are possible routes 
between two locations in each time step, or stage. The numbers on the links could 
represent travel time, or costs, or some relative measure of benefits. Suppose these 
link numbers represent costs and we wish to minimize the total cost of going from 
location 1 to location 10. Using a dynamic programming procedure, we can do 
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Fig.7.1 A dynamic programming network showing nodes as locations, links as routes between 
two successive locations, and stages as the succession of decisions made over time or space 
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Fig.7.2 Results of dynamic programming for finding the best decision at each node at the begin- 
ning of the last stage, 4. The F(j) values are the total minimum costs of going from node j to node 
10 


this without having to consider all possible combinations of routes from node | to 
node 10. 

Referring to Fig. 7.1, we cannot immediately see how best to travel from node 
1 to node 10. However, if we could determine the best (cheapest cost to node 10) 
link to take from each node in the network, then it would be easy to determine how 
to go from node 1 to node 10 the cheapest way. Dynamic programming provides 
an efficient way of doing that without the need to look at all possible alternative 
routes. To start the dynamic programming procedure, we can start where the deci- 
sion is obvious, say at nodes 8 and 9, and then work backward, from right to left, 
toward node 1. At each node, we want to determine and record the cheapest way 
to go from that node to node 10. Call FG) the cheapest cost to go from node j to 
10. We also want to keep track of the best decision, or link, at each node j. 

We begin at the last stage by determining how best to travel from node 8 to 
node 10, and from node 9 to node 10. There is only one choice at each of those 
nodes. The results of those decisions are shown in Fig. 7.2. 

Moving backward to the previous stage, stage 3, we can find the minimum total 
cost to go to node 10 from nodes 5, 6, and 7. F(5) = min{5 + F(8), 3 + FQ)} = 
min{5 + 6, 3 + 7} = 10. F(6) = min{7 + F(8), 8 + F(9)} = 13. F(7) = min{2 
+ F(8), 4 + F(9)} = 8. We can mark the decisions that are best in each case with 
an— as is shown in Fig. 7.3. Keep in mind that the F(j) values are the minimum 
costs to proceed from node j to node 10. 

Note that we cannot compute the values of the minimum costs at each node 
at the beginning of stage 3 without first computing those values for each node at 
the end of stage 3 or equivalently at the beginning of stage 4. The same applies 
to each remaining stage, namely stages 2 and 1. In general, for each node or state 
(location) j at the beginning of a stage that is linked to node k at the end of the 
stage: 


FG) = minimum over all nodes k {cost of link from j tok + F(k)} 


for each node j in each stage. 
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Fig.7.3 Results of dynamic programming for finding the best decision at all nodes at the begin- 
ning of the third stage 


In this case, at stage 3, the beginning nodes j are 5, 6, and 7, and the ending 
nodes k are 8 and 9. 

Moving to stage 2, we can compute the minimum cost to go from nodes 2, 3, 
and 4 to 10, in a similar manner, again denoting the best decision by > . Note that 
these total remaining minimum cost values, Fj), computed for each node j at the 
beginning of each stage can be determined without the need to look beyond the 
stage we are in because we know the minimum costs of proceeding beyond that 
stage. At each ending node k. 


FG) = the minimum among all links from node j to nodes k of 
{cost of link from j tok + F(k)} 


Once we know the minimum costs of going from nodes 2, 3, and 4, namely 
F(2), F(3), and F(4), to node 10, we can move backward to the first stage and 
determine the total minimum cost to travel from node 1 to node 10, and the best 
decision to make at node 1 to achieve that minimum total cost, F(1). For this 
example, the minimum total cost is 15. 

Now we can determine the optimal (minimum cost) path just by following the 
arrows beginning at node 1. This path is 1, 4, 5, 9, and 10 for a total cost of 3 + 
2+34+7=15. 

What has just been demonstrated is how discrete dynamic programming breaks 
down multiple variable optimization problems into many single variable optimiza- 
tion problems. Instead of finding the minimum total cost of traveling from node 
1 to node 10, one could use the exact same procedure for a maximization prob- 
lem where the maximum value at each node is recorded instead of the minimum 
value. Because the problem is discrete, there is no restriction on the shape of any 
cost or benefit or other objective functions. There could be restrictions or con- 
straints limiting the possible decisions or links at any node, and hence only the 
feasible decisions should be included in any dynamic programming network. In 
other words, for this example, going from a beginning node j to an ending node k 
in any stage has to be feasible. 
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Fig.7.4 Results of dynamic programming for finding the best decision at all nodes at the begin- 
ning of stage 2 


F(2) = 13 


pad gt 
NERI 


P 2 pow . ae 
5 d 

F(1)=15 > e X F(6) = 3 = 

~A 7 F(3) rd, Pi A Ly i Me “ag {g5'2 = oO 


Fig.7.5 Final stage of dynamic programming approach for finding best decision at node 1 to go 
to node 10 and the route to take 


The sequence of steps shown in Figs. 7.2, 7.3, 7.4 and 7.5 is called a backward 
moving approach for solving a dynamic programming network model. We began 
where we wanted to end up and worked backward, from right to left over each 
state in each successive stage to an initial state where we are before solving the 
model, namely node 1 at the beginning of stage 1. Once we know the best decision 
to make at each node in the network, we can use that knowledge beginning at node 
1 to work our way through the network following the arrows from node to node 
to finally reach node 10. When solving for the best decisions at each node in any 
stage, there is no need to consider any of the link costs in other stages. 


7.1.2 Resource Allocation 


Consider the previously defined resource allocation problem in which 6 resources 
are to be allocated to three users, each resulting in net benefits. Let X be the 
allocation to the first user, user #1. The net benefits are 6X — X? for a maximum 
at X = 3. More than that reduces the net benefits. Let Y be the allocation to user 
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#2. The net benefits are 7Y — 1.5¥Y? for a maximum when Y = 7/3. Allocating Z 
to user #3 yields net benefits of 8Z — 0.5Z? for a maximum when Z = 8. The 
sum of all the desired allocations is 13.33. If the available resources are less than 
13.33, solving the following optimization model will identify the allocations that 
maximize the total net benefits. 


Maximize Bı (X) + Bo(Y) + B3(Z) 
Subject to: 
By(X) = 6X—X?; Bo(Y) = 7Y —1.5Y?; B3(Z) = 8Z—0.5Z7 
X+Y¥+Z<6. 


Making discrete (e.g., integer) allocations allows us to draw a network of this 
allocation problem such as shown in Fig. 7.6. 

The nodes of Fig. 7.6 represent the amount of resources available for the 
remaining allocations, and the links represent the allocation to a particular user. 
The numbers on the links are the benefits resulting from that decision. The prob- 
lem is to find the best path from the initial node representing 6 resources available 
to allocate to the three users to an ending node after making allocations to the three 
uses. Since the maximum of all allocations the users would like is 13.3, clearly 
the final state of the system after all allocations are made will be 0. There will not 
be any unallocated resources in an optimal solution. 

Assuming a backward moving approach, designate Fj(S) as the maximum net 
benefits that can be obtained in remaining allocations given S resources available 
at the beginning of stage i. Starting at stage 3, we compute all the F3(S) values 


Fig.7.6 Network 
representing the resource 
allocation problem with 
integer allocations. The 
numbers in the nodes are the 
resources available for 
subsequent allocations. Each 
link’s allocation is the 
difference between the two 
node state values. The 
numbers on the links are the 
benefits gained if that 
particular allocation decision 
is taken. Missing links are 
ones clearly not feasible or 
optimal 


Allocations X (0) Allocations Y T Allocations Z ©) 


Benefits: 6X — X? 7-15 8Z - 0.52? 
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before moving to compute all the Fy(S) values, and finally, we compute Fi(S = 
6). 


Fi(S) = maximum over all integer allocations < S 


{allocation benefits + Fj, ,(S — allocation)} for all values of S. 


We also keep track of the best decision at each node (shown by an arrow). This 
backward moving approach is illustrated in Fig. 7.7. 

The maximum total benefits that can be obtained from allocating 6 resources 
available is Fı (6) = 34.5. Arrows in Fig. 7.7 show that allocation to user #1, X = 
1 leaving 5 resources, so the allocation to user #2, Y= 1 leaving 4 resources, and 
hence the allocation to user #3, Z = 4. 

Discrete dynamic programming models can often be solved using a forward 
rather than a backward moving approach. In this case, we begin at the initial 
node(s) in the network and for each node find f,(S) = Maximum net benefits that 
can be obtained from past allocation decisions given S resources available at end 
of stage i. All values of fı (S) are computed before moving to compute all of f2(S), 
and finally compute f3($ = 0), keeping track (e.g., using an arrow) of the best 
decision to get to where you are at the end of a stage. At each node, you are asked 


Fig.7.7 The backward moving approach to solving the resource allocation problem. The num- 
bers next to each node are the maximum remaining benefits, F;(S), and the arrows signify the best 
allocation link given the available resources, the numbers in the nodes. The link benefits in stage 
3 are the F3(S) values shown next to the nodes at the beginning of stage 3 
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Fig.7.8 Solving the 
resource allocation problem 
using the forward-moving 
approach of dynamic 
programming. The numbers 
at the bottom of each node 
represent the maximum 
benefits obtained from 
previous allocations given the 
resources remaining, the 
numbers in the nodes. The 
link (allocation) benefit 
values are not shown here but 
are as shown in Fig. 7.6 and 
used to compute the 
maximum benefits obtainable 
given the remaining resources 
to allocate to the remaining 
uses 


fa(3) f2(1) £3(0) 


what is the best node to have come from to get to where you are. This approach 
is illustrated in Fig. 7.8. 

To backtrack to find the optimal allocations, note that the best allocation to user 
#3 is 4. Therefore, the optimal state to be in at the beginning of that last stage is 
4. This is the same state to be in at the end of stage 2. The arrow into that state 
shows that the best state to come from is state 5. And to get to state 5 at the end 
of stage 1 is to come from state 6. Hence, the best allocation to user #2 is 1, and 
to user #1 is 1, for the same total benefits of 34.5. 


7.1.3 Capacity Expansion 


Public works departments are often faced with determining when and how much 
infrastructure capacity to add to meet increasing demands over time. Why is this 
an issue? Why not just add the amount of capacity needed when it is needed? The 
answer is shown in Fig. 7.9. 

The costs of adding additional capacity to meet the increasing demand over time 
are not defined by nice continuous convex functions. If they were, one could just 
add the capacity needed when it is needed and not be concerned with the uncer- 
tainty of future demands and costs. Typical infrastructure capacity cost functions 
have a fixed component and exhibit economies of scale, i.e., decreasing average 
and marginal costs with increasing capacity additions. A fixed cost exists if any 
capacity is to be added, otherwise, it is 0. The more times the capacity is increased 
the greater the sum of fixed costs. Fixed cost is a function of existing capacity 


7.1 Discrete Dynamic Programming 83 


Added capacity demand cost 


Added Capacity 
Time 


Fig.7.9 Typical demand and cost functions for infrastructure capacity 


among other factors. Hence, it makes economic sense to overbuild—to add more 
capacity than is needed so as to reduce the number of times capacity is to be added 
and to achieve lower average costs. 

The dilemma of course is that we are not certain of both future demands and 
costs. We will return to that issue later. First, consider an example where it is 
assumed future demands are known and must be met. Meeting the demand is the 
condition most public works departments consider as a constraint. A general capac- 
ity expansion model that can be used to identify least-cost expansion schedules that 
meet future demands can be stated as 

Minimize the present value of future expansion costs subject to meeting future 
demands. 

Let A(t) be the capacity added to the existing capacity K(t) in period t at a cost 
of C,(K(t), A(t)) that is to be paid at the beginning of period t. Let r(t) be the 
discount rate in period t, D(t) the capacity demanded by the end of period t, and n, 
the number of time periods being considered. A basic capacity expansion model 
(assuming no capacity decay over time) can be written as 


n 
Minimize 5 CK, AW) 1/0 + ry! 
t=1 
Subject to: 
K(0O) = existing capacity at beginning of period 1. 
K© +46) = K(t+1) > D@t =1,2,..., n 


The data needed to solve a discrete example of this model are specified in Table 
PI 

The capacity expansion problem whose data are shown in Table 7.1 can be 
solved using discrete dynamic programming. It assumes 4 construction periods of 
5 years each. It provides estimates of the present value of the costs of additional 
capacity needed at the end of each 5-year period for the next 20 years. 

The discrete options in the first 5-year period are to add either 2, 4, 6, 8 or 10 
units of capacity. In period 2, one can add any discrete even amount of capacity 
up to a total capacity of 10 units. Hence, if the beginning period capacity is 2, at 
least 4 and at most 8 units can be added. And so on to the last period that must 
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Table 7.1 Data showing future demands and costs of a capacity expansion problem 


Discounted costs of additional capacity Total additional 
capacity required at 
end of period 


Units of additional capacity 
Years 2 4 6 8 


Period 10 


have an initial capacity of at least 8, and if it is 8 two units can be added to reach 
10 units total. 

The dynamic programming network for this example problem is shown in 
Fig. 7.10. 

Solving this problem, using either a backward or forward-moving approach, 
will result in two different least-cost solutions, for a total present value of 26. The 
added capacities in successive construction periods are either 10, 0, 0, O or 6, 0, 
4, 0. Which is better and why? They both cost 26, so the decision has to be based 
on other criteria. 


Capacity 


1 2 3 4 Time 


Fig.7.10 A network representation of the capacity expansion problem is defined in Table 7.1. 
Links represent possible discrete feasible capacity expansion alternatives given the existing capac- 
ity at the beginning of each construction period. The numbers on the links are the present values 
of the costs of expansion 
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How should we deal with the uncertainty of future demands and costs? How 
should we deal with the assumed time horizon of 4 periods, as clearly there is a 
future after that time in which additional capacity may be needed? In other words, 
how should we use a model like the one just presented? 

Perhaps the answer to these questions will become clearer by asking another 
question. What should we do after implementing the first period’s decision, A(1)? 
Wait five years and then refer to this model’s output and implement the second 
decision, A(2)? Obviously not. Conditions may have changed and there are new 
estimates of future costs, demands, interest rates, and time horizons. 

What is of interest when using a model such as this one is what to do now. How 
does the assumed time horizon and estimates of future demands and costs and 
interest rates impact this first decision, A(1)? If they do not, one can be more con- 
fident in the robustness of that first decision, at least with respect to the assumed 
objective, which in this case is minimizing the present value of the total cost. 


7.2 Conclusions 


Dynamic programming like all optimization methods has its advantages as well 
as limitations. It is well suited to address optimization problems which can be 
viewed as having to make a sequence of decisions and in which there are only a 
limited number of state variables and their discrete values, such as existing capac- 
ity, or resources available to allocate, in the examples just discussed. It is not 
dependent on the form of the objective function as are other methods previously 
discussed. While network representations of the dynamic programming optimiza- 
tion problems were used in this chapter to illustrate the two solution approaches, 
mathematical recursion equations can be created for finding the best decisions at 
each state (node) in each successive stage of a problem. These equations can be 
incorporated into a spreadsheet and would be used for solving larger problems 
than those presented in this chapter. These equations will be developed for solving 
more complex problems presented in later chapters (Fig. 7.11). 


Fig.7.11 The shortest distance problem. User: Dcoetzee, Wikimedia Commons CCO 1.0 Univer- 
sal Public Domain Dedication 
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1. Consider the allocation problem of allocating resources to three users. The allo- 
cations are X, Y, and Z. User 1 total revenue is 6X — X?. User 2 total revenue 
is 7Y — 1.5Y?. User 3 total revenue is 8Z — 0.5Z?. The goal is to determine the 
values of X, Y, and Z that maximize {6X — X*+7Y — 1.5Y*+8Z — 0.5Z7} given 
6 units of resources available. 

Show how to solve this allocation problem using discrete dynamic program- 
ming with integer allocations. Show how the dynamic programming network 
would be modified to be able to consider 8 integer resources as well as 6 
resources to allocate to the three users having the same net benefit (total return) 
functions. What would the integer allocations and total returns be given 8 avail- 
able resources? Show how this can be solved using the forward-moving and 
backward-moving approaches. 

To show that DP was used, show all F(S) values for each node representing 
a state S, and the best decision (arrow or heavy line) if more than one possible 
decision. 

2. (a) Using dynamic programming (network) solve the following capacity expan- 
sion problem for the next 20 years (45-year construction periods) using 
forward and backward moving approaches. 

The following table provides estimates for the costs of additional water 
treatment plant capacity needed at the end of each 5-year period for the next 
20 years. Find the capacity expansion schedule that minimizes the present 
values of the total future costs. If there is more than one least-cost solution, 
indicate which one you think is better, and why. 


Discounted costs of additional capacity Total additional 
capacity required at 


Units of additional capacity 


Period Years 2 4 6 8 10 is period 
1 1-5 12 15 18 23 26 

2 6-10 8 11 13 15 

3 11-15 6 8 8 

4 15-20 4 10 


Note: The discrete options in the first 5-year period are to add 2, 4, 6, 8 
or 10 units of capacity. In period 2, one can add any discrete even amount of 
capacity up to a total capacity of 10 units so if the beginning period capacity 
is 2 at least 4 and at most 8 units can be added. And so on to the last period 
which must have an initial capacity of at least 8, and if so only two units can 
be added to reach 10 units total. 
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(b) The cost in each period t must be paid at the beginning of the period. What 
was the discount factor used to convert the costs at the beginning of each 
period f (say C(t)) to present value (or discounted) costs shown above? In 
other words, how would a cost at the beginning of period t be discounted 
to the beginning of period 1, given an annual interest rate of r? (Only the 
algebraic expression of the discount factor is asked, not the numerical value 
of r.) 

(c) How would you deal with the uncertainty of future demands and costs? In 
other words, how would you use a model like the one you developed? 

3. Water Quality Management Model: 

Find the wastewater treatment efficiencies at sites 1 and 2 that meet stream quality 

standards at sites 2 and 3 at a total minimum cost. Currently, there is no treatment. 

All the wastewater is discharged into the stream. 


Current Pollutant Concentrations: 58 mg/l 95 mg/l 
Maximum Allowable Concentrations: 18 mg/l 23 mg/l 


Available Data: 

Streamflow = 1000 m?/day at all sites. 1 kg/day/1000 m?/day = 1 mg/l; 

Fraction of waste discharged into the stream at site 1 that reaches site 2: 0.25. 

Fraction of waste discharged at site 1 that reaches site 3: 0.15. 

Fraction of waste at and discharged into the stream at site 2 that reaches site 
3: 0.60. 

Limits of treatment: removal of 30 % required, but no more than 90%, for both 
sites. The initial concentration just upstream of site 1 is 32 mg/l. 

The marginal cost of treatment at site | is 30 over the range of possible treatment 
fractions. 

The marginal cost of treatment at site 2 is 20 over the range of possible treatment 
fractions. 

Find the least-cost solution that meets the quality standards using dynamic 
programming. 

4. Blueberries 

There are three farmer’s markets that sell organically and locally grown blueber- 
ries. The farmer who grows these blueberries gets 90 percent of the income from 
their sales; the markets get the other 10%. The demand for blueberries differs in 
each market. Some smart economist has determined that the demand (unit price) 
functions for blueberries at the three markets (m = 1, 2, 3) are 6/(1 + Q1), 7/1 
+ 1.5Q2), and 8/(1 + 0.5Q3), respectively. 
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A Demand functions for blueberries. 
Unit 


price 


Qn 


At each market m, the unit price varies each week depending on the amount 
of blueberries available, Qm, to be sold. How should the farmer distribute a crop 
ranging from 1 to 6 bushels of blueberries each week to maximize the total amount 
of income received from all three markets? 

Solve for the maximum revenue obtainable from a total of 6 bushels using 
discrete dynamic programming, assuming integer allocations. Use both backward 
and forward approaches. Show your work on a network, not just the solution. 
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Linear Optimization Modeling 


ABSTRACT 


The chapter introduces linear programming, arguably the most used optimiza- 
tion method applicable when all the model terms are linear. Graphical solution 
approaches to solve two-variable linear models are used to illustrate how linear 
programming algorithms solve models containing many more variables as are 
typical of most real-world problems. 


8.1 Introduction 


Undoubtedly the most commonly used of all the mathematical programming (con- 
strained optimization) methods is linear programming. Developing and solving 
linear optimization models is often the first topic addressed in courses in sys- 
tems analysis. This is not because the world is linear, but because the algorithms 
(solution methods) used to solve linear models are so efficient and are able to 
solve problems with many—even thousands—of variables and constraints, as long 
as they are linear. Thus, many tricks exist for making non-linear functions linear. 
They are often employed just because of the efficiency and widespread availability 
of the solution methods for linear models. Linear programming has found many 
applications in the military, in government agencies, industry and in agriculture, 
ecology, economics, engineering, public health, and urban planning to mention 
only a few subject areas. 

Hence, it seems reasonable to show how linear problems are solved, at least 
graphically, and when necessary, how some non-linear components of a model 
may be made linear to take advantage of linear optimization solution methods. 
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Fig.8.1 A plot of the three 

constraints of the linear Y 
model defines the region of 

feasible solutions for X and Y 


If a model is linear and has only two variables such as 


maximize X + Y 
subject to: 2X +Y <4, 
X>0, Y>0 


a method for solving linear programming models can be illustrated graphically. 
The first step is to find the region of values of X and Y that satisfy all the con- 
straints. This region is called the feasible region. The combinations of X and Y 
values in this region meet all the constraints. They are called feasible solutions. 
This region of feasible solutions can be shown by a plot of each constraint on a 
graph whose axes are the two variables. In this case, there are three constraints 
(Fig. 8.1) 


2X +Y<4, xX >0, Y>0. 


All the X Y pairs of values in the shaded region and its boundaries, called the 
feasible region, satisfy all the constraints. Optimization problems that do not have 
feasible regions have no feasible solutions, meaning that not all constraints can 
be satisfied. Unbounded feasible regions result from one or more variables going 
to infinity as would be the case if there were no constraint 2X + Y <4 or if the 
constraint had to be greater or equal to 4 or any other number. 

To find the best combination of X and Y values in this feasible region, set the 
objective function equal to some value, such as X + Y = 2, and then plot that 
equation. This is shown as a dashed line in Fig. 8.2. Since that function is to be 
maximized, our goal is to find the maximum value of its right-hand side while 
some part of that function is in the feasible region or on its boundary. Changing 
the right-hand side moves the objective function, the dashed line, up and down but 
doesn’t change its slope. If we change the 2 to a 4, we get the dash-dot line shown 
in that same figure. This is as high as the line can be raised, i.e., as large a value 
as the right-hand side of that objective function can be, while some part of that 
function is in or on the boundary of the feasible region. 

This plot shows that the optimal values of X and Y are 0 and 4, respectively. For 
all continuous variable linear optimization problems, the optimal solution will be 
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Fig.8.2 Finding the optimal 
solution to the linear model 
by moving the objective 
function that is to be 
maximized to its highest 
value position while it still is 
in or on the boundary of the 
feasible region 


one of the corner points of the feasible region. Thus, computer programs solving 
such linear optimization models need only to compute and compare the objective 
function values at the corner points (intersections of constraints) of the feasible 
region, rather than searching among the infinity of feasible solutions within the fea- 
sible region. Furthermore, once a corner point produces an objective value greater 
than that of all immediately adjacent corner points, the search for the best solu- 
tion can end. No more corner points need to be considered. Some of you may be 
interested in thinking about why this is true. 

Even though computer programs (e.g., Solver in Excel) will always produce cor- 
ner point solutions it is possible that there are multiple optimal solutions, other than 
corner point ones, when the objective function has the same slope as one of the bind- 
ing constraints. In this example, if we were maximizing 2X + Y, any combination of 
non-negative X and Y values in which 2X + Y = 4 would maximize that function. 

Before leaving this problem, it should be obvious that if this objective were to 
be minimized, the optimal solution would result when the objective line in the plot 
would be lowered until it went through the origin of the plot where X = Y = 0. 


8.2 Dual Variables 


Of interest to many using optimization models involving constraints is the sensi- 
tivity of the objective function value to changes in bounds of those constraints. In 
this model, the upper bound on the constraint 2X + Y is 4. With 4 as an upper 
bound on that constraint, the maximum value of the objective function X + Y is 4. 
If the upper bound were 5 instead of 4, the maximum value of the objective func- 
tion would be 5, an increase of 1. Similarly, if 4 were decreased to 3, the objective 
function value would decrease by 1. This change in the objective function per unit 
change in the bound on the constraint is called the shadow price or dual variable 
or Lagrange multiplier associated with that constraint. It signifies the change in 
the objective function value associated with a unit change in the upper or lower 
bound of the constraint. 

For any linear or non-linear model containing a vector of decision variables X 
and m constraints of the form 


Maximize or Minimize F(X) 
Subject to: g)(X) < or >b; fori = 1, 2, ..., m. 


92 8 Linear Optimization Modeling 


each dual variable of each constraint i signifies the change in the optimal value of 
the objective function, F(X), given a unit change in the value of bj. It is the slope 
of the objective function at the optimal values of the decision variables when 
the constraint equals b;. For non-linear models, the shadow price of any binding 
constraint i changes as b; changes. Hence, the shadow price applies for only small 
changes in b; relative to the value of b;. For linear models, the range of change in 
bi for which the value of the shadow price applies can be larger and will depend 
on the particular model. 

Computer programs, such as Solver in Excel, used to solve optimization models 
not only provide the optimal values of the decision variables X, assuming they 
exist, but also the values of the shadow prices, also called dual variables or dual 
prices or Lagrange multipliers) associated with each constraint i. Again, these 
values are based on a unit change in the value of each bj. For linear models, 
the output of Solver also specifies the range of each b; value over which its dual 
variable value applies (See Chap. 6). 


8.3 A Production Model 


Suppose for a community fundraising project, two products are to be produced, 
Product A and Product B. Each product is offered for sale for $60 and $80, respec- 
tively. Each product takes one unit of wood and the total amount of wood available 
is 80. Making each Product B requires 2 h of labor, half of what product A requires 
to make, and the total amount of labor hours available is 280. Desired are the 
amounts of Product A, denoted as A and Product B, denoted as B, that maximize 
the total income (Fig. 8.3). 
This optimization problem can be expressed as 


Maximize income = 60A + 80B 
Subject to : 
Material Constraint : A + B < 80 
Labor Constraint : 4A + 2B < 280 
Non - negativity Constraints: A > 0, B > 0 


Since this is another two-variable problem, we can solve it graphically 
(Fig. 8.4). 

As one can see from this plot, only two constraints of the four are binding, 
namely A + B < 80 and A > 0 meaning that instead of inequalities they are equali- 
ties. Thus, the dual variable value of the labor constraint is 0. Having more labor 
doesn’t increase income. But having another unit of wood, in this case, would 
increase income by $80. As seen from this plot, this rate of change of $80 per 
unit of wood would apply from 0 up to a supply of wood of 140. After that labor 
would be limiting the total obtainable income. If we were forced to produce a unit 
of A, we would have to produce one less B and the maximum total income would 
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Fig.8.3 Putting things 
together after determining 
how much wood and labor 
are available. (Public domain. 
Bureau of Labor Statistics 
(BLS)) www.bls.gov/ooh/pro 
duction/woodworkers.htm 


decrease by 80-60 = 20. This is called the ‘reduced cost’ of A. Reduced costs 
only apply to variables whose optimal values are 0. 

Also, evident from Fig. 8.4 is that if the coefficient of A, 60, in the objective 
function, 60A + 80B, increased by 20, or if the coefficient of B decreased by 20, 
then any non-negative values of A and B that summed to 80 would be optimal. 
Any additional changes until the coefficient of A is twice that of B would result in 
an optimal solution where the two constraints intersect. At this point, A is 60 and 
B is 20. Beyond that, the optimal solution would be at A = 70 and B = 0. 


A+B<80 


4A + 2B < 280 


Feasible 


Region 


Fig.8.4 Graphical solution to the production model 
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8.4 Crop Production 


Each year farmers have to decide what crops to grow, where, and how much. 
Assume a farmer can grow three types of grains (Fig. 8.5). The farmer wants to 
determine how much of each type of grain to grow taking into account the labor, 
land and water resource requirements, the resources available, and the incomes 
per hectare of each crop. The resource requirements for each crop, the available 
resources, and the incomes per hectare of each crop are given in Table 8.1. 

Letting the decision variables be the number of hectares of each crop, denoted 
as Corn, Wheat, and Oats, an optimization model for finding the hectares of each 
crop that maximize total income can be written as 


Maximize total income : 400 Corn + 200 Wheat + 250 Oats 
Subject to : 

Corn + Wheat + Oats < 625 land constraint. 

3 Corn + Wheat + 1.5 Oats < 1000 water constraint. 

0.8 Corn + 0.2 Wheat + 0.3 Oats < 300 labor constraint. 


Fig.8.5 Harvesting a grain 
crop from farmland. CC 
BY-SA 3.0. https://en.wikipe 
dia.org/wiki/Harvest#/media/ 
File:Agriculture_in_Volgog 
rad_Oblast_002.JPG 


Table 8.1 Data required to determine how much of each grain crop to grow to maximize total 
income 


Crops Corn Wheat Oats 

Resources Max. available 

Water 1000/week 3.0 1.0 1.5 units/week/hectare 
Labor 300/week 0.8 0.2 0.3 hours/week/hectare 
Land 625 hectares 

Yield (income) 400 200 250 $/hectare 
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The non-negativity constraints will obviously be satisfied and hence need not 
be included in the model. 
Using a computer to solve this model, one solution is 


Maximum objective value: 162500.0 


Variable Value Reduced cost 
Corn 187.5000 0.000000 
Wheat 437.5000 0.000000 
Oats 0.000000 0.000000 
Constraint Slack or surplus Dual variable 
Land 0.000000 100.0000 
Water 0.000000 100.0000 
Labor 62.50000 0.000000 


This solution shows that both land and water limit how much the farmer can 
grow. The dual variable values show that If the farmer could add one more unit of 
water, or land, the income would increase by $100. In addition, the solution shows 
no oats being grown, yet forcing a unit of oats to be grown does not reduce the 
total income, as indicated by Its ‘reduced cost’ of 0. This suggests that there are 
multiple optimal solutions, i.e., different values of corn, wheat, and oats that give 
the same maximum total income. 

For example, if a constraint were added forcing the hectares of oats to be 100, 


the solution becomes 


Objective value: 162500.0 


Variable Value Reduced cost 
Corn 162.5000 0.000000 
Wheat 362.5000 0.000000 
Oats 100.0000 0.000000 
Constraint Slack or surplus Dual variable 
Land 0.000000 100.0000 
Water 0.000000 100.0000 
Labor 67.50000 0.000000 
Oats required 0.000000 0.000000 


The range of optimal solutions is shown in the sketch below (Fig. 8.6) 
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Fig.8.6 Different 
combinations of each crop 
that produce the same 
maximum income 
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8.5 Police Scheduling 


A community has minimum requirements for the number of police (Fig. 8.7) that 
need to be on duty during each 4-h period. These requirements are shown in Table 
8.2. The actual number employed cannot be less than that. Each police person 
works 8 consecutive hours per day. (For simplicity assume no days off.) There are 
no part-time police, and union regulations prohibit split shifts. The problem is to 


Fig.8.7 Police providing a 
public service. Creative 
Commons Attribution 2.0 
Generic license. https://com 
mons.wikimedia.org/wiki/ 
File:Beijing_Police_is_hel 
ping.jpg 
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Table 8.2 Data required to 


a : Time period Variable Police required 
determine how many police 
to hire in each 4-h period ofa Noon—4 pm x12 10 
day 4 pm-8 pm x16 25 
8 pm—midnight x20 30 
Midnight-4am x0 40 
4 am-8 am x4 10 
8 am-noon x8 15 


find a daily schedule that employs the fewest number of police officers. Table 8.2 
also defines the variables, xt, used to represent the number of police who begin 
their work at hour t. 

The objective is to find the minimum total number of police needed to be hired 
throughout the day. 


Minimize x0 + x4 + x8 + x12 + x16 + x20; 
Subject to the requirements for each 4-h period during the day: 


Period 0000—0400 X20 + x0 > 40 
Period 0400—0800 x0+ x4 > 10 
Period 0800—1200 x4+ x8 > 15 
Period 1200—1600 x8+.x12 > 10 
Period 1600—2000 x12+ x16 > 25 
Period 2000—2400 x16 + x20 > 30 


One solution is as shown in Table 8.3. 

Once again zero ‘reduced costs’ for variables x8 and x16 whose values are 
O suggests there are many optimal solutions requiring a total police force of 80. 
Hence, it is possible to alter the times some police can start their shifts to better 
satisfy other personnel or police department objectives, if any, without requiring 
more police. For example, if it were desired-to minimize the maximum shift size 
while also minimizing the total number of police needed, one solution is given in 
Table 8.4. 

This policy tends to reduce the variation in the number of police beginning their 
work in each time period. 
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Table 8.3 An optimal Variable Value Reduced cost 
solution to the police 
scheduling problem, x0 10 0 
requiring 80 police x4 15 0 
x8 0 0 
x12 25 0 
x16 0 0 
x20 30 0 
Constraint Slack or surplus | Dual variable 
0000-0400 0 =] 
0400-0800 15 0 
0800-1200 0 =I 
1200-1600 15 0 
1600-2000 0 =1 
2000-2400 0 0 
Table 8.4 Another optimal Variable Value 
solution to the police 
scheduling problem, x0 20 
requiring 80 police x4 5 
x8 10 
x12 15 
x16 10 
x20 20 


8.6 Project Scheduling 


Large infrastructure projects involving many personnel and machines and materials 
are commonly divided into a number of tasks. Each task needs to be completed 
before the entire project is completed. Of interest to project managers is when to 
begin each task and how to allocate the personnel, machines, and materials among 
tasks to minimize the total time and cost needed to complete the entire project 
(Fig. 8.8). 

A number of methods exist to estimate task start times. One is to create an opti- 
mization model that when solved will identify the task start times that minimize 
the total project time. Clearly, if each task had to be done one after another, the 
total project time would simply be the sum of all the task durations. In reality, 
some tasks can be worked on at the same time, or stated another way, what tasks 
need to be completed before others can begin depends on each particular task. The 
constraints of the model need to identify the sequencing of the tasks. 

To illustrate, assume a particular project consists of 6 distinct tasks. One task 
can begin right away (task A), and all the other tasks can’t begin before some 
of the others are completed. These conditions along with the expected duration, 
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Fig.8.8 Deciding when to 
schedule various project tasks 
to complete an entire project. 
iStock licence number 
2075982143 


in weeks, of each task, are given in Table 8.5 below. The plot also shows the 
necessary sequencing as specified in the table. 

To find the minimum number of weeks, T, required to complete the project and 
the corresponding starting times of each task, designated by A, B, C, D, E, and F, 
the following linear model can be solved: 


Minimize T 

Subject to : 
B>A+5 
C>A+5 
D>B+3 
D>C+6 
E>C+6 
F>D+7 
F>E+4 
T>F+2 


Its solution is shown in Table 8.6. 


Table 8.5 Sequence and duration of project tasks 


Project task i | Must follow | Duration, D; | Sequence networking 
A = 5 
B A 3 
C A 6 
D B,C 7 
E C 4 
F D, E 2 
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Table 8.6 Solution of 


; : Variable Start time Reduced cost 
project planning problem 
showing start times of each A 0 1 
task and total project time, T p 5 0 
C 5 0 
D 11 0 
E 11 0 
F 18 0 


The minimum total project time, T, is 20. In any project such as this one, 
usually, only some of the tasks determine the total project time. In this example, 
this sequence of tasks is A, C, D, and F. Tasks B and E could start somewhat later if 
they do not alter the start times of the following tasks. This may be advantageous 
with respect to the management of personnel, material or machines. Of course, 
it may be advantageous to extend the total project time if cost savings result. 
However, extending total project completion times could result in penalties. 

Assume that a penalty of 2000 per week will apply for each week the project 
time is over 18. Now the question is can this project time be reduced and if so at 
what cost, and will that cost be less than the penalty. The objective becomes one 
of minimizing the total additional project cost of exceeding the target time of 18. 
Assume the cost of reducing the duration D; of task i by Aj is a known function, 
Ci(A;), of that reduction. The objective of the model now is one of finding the task 
reductions, Aj, that minimize the sum of task reduction costs, Xi- a, F(G(4;)), 
and the penalty cost, 2000 (T — 18). 


Minimize 2000 (T—18) + 5 (Ci (Aj) 
i=A, F 
Subject to : 
B>A+5-— ^4 
C>A+5—Aq 
D>B+3— Ag 
D>C+6-Ac 
E>C+6-Ac 
F>D+7—Ap 
F>E+4—Ag 
T>F+2—Ap 
This model assumes the total project time, T, will be no less than 18. If we were 
not sure that T would be at least 18, then we could add the constraint defining the 


positive difference, P, of T — 18. 


T — 18 < P andP > 0. 
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The objective function would now be to 


Minimize 2000 P+ Y` (Ci(Aj)). 
i=A, F 


This modification makes sure there is no negative penalty if T < 18. 


8.7 Trash and Pollution 


The management of trash is an issue facing every community. Assume a particular 
city burns a total of 3000 tons of trash per day in three incinerators. All three have 
antipollution devices. Their emissions differ, as shown in Table 8.5. At present, all 
three incinerators are operating at full capacity. The remainder of the city’s trash, 
another 1500 tons per day, is dumped into a sanitary landfill area. This landfill 
option is very expensive compared to incineration. The city is under court order to 
reduce the total emissions of sulfur dioxide to 400,000 units per day and particulate 
emissions to 50,000 per day. These maximum allowable emissions are less than 
what is being discharged at the present time. The city wants to know the most 
economical way to meet these standards (Fig. 8.9 and Table 8.7). 


Fig.8.9 Burning trash at an 
incineration plant. Credit 
Pixabay/CCO public domain. 
https://phys.org/news/2021- 
11-life-carbon-capture.html 


Table 8.7 Capacity an Incinerator | Capacity (tons/day) Emissions per day/ton 
emission data pertaining to burned 
the incineration of trash 6 ea 
2 articulates 
A 1200 250 20 
B 800 150 30 
C 1000 220 24 
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Let the variables A, B, C be the tons of trash burned per day in incinerator A, 
B, and, C, respectively. The city’s objective is to burn as much trash as possible 
while meeting the emission and capacity constraints. 

This is an optimization problem that can be written as 


Maximize A + B+ C Amount of trash burned per day. 


Subject to : 


250A + 150B + 220C < 400, 000 maximum sulfur dioxide emission 
20A + 30B + 24C < 50, 000 maximum particulate emission 


A < 1200 maximum capacity of incinerator A 


B < 800 maximum capacity of incinerator B 


C < 1000 maximum capacity of incinerator C 


The solution of this model is given in Table 8.8. 

The solution shows that all three incinerators should be used, but only B at 
capacity. The dual variables of the emission constraints indicate the additional 
tons of trash that could be burned per unit increase in the emission standards. For 
example, if 100 more units of SO2 could be released, then 0.25 more tons of trash 
could be burned. Depending on the cost savings that would result from reducing 
the amount of trash taken to the landfill, the city might wish to argue for less 
strict standards. Alternatively, it might offer to further reduce its emissions in the 
interest of improving the public’s health or reducing the adverse impacts of climate 


change. 


Table 8.8 Solution to 
incinerator problem 


Objective value: 1987.5 tons of trash 
can be burned per day 


Variable Value Reduced cost 
A 625.0000 0.000000 

B 800.0000 0.000000 

C 562.5000 0.000000 
Constraint Slack or surplus Dual variable 
SO2 0.000000 0.2500000E-02 
Particulate 0.000000 0.1875000E-01 
Capacity A 4575.0000 0.000000 
Capacity B 0.000000 0.6250000E-01 
Capacity C 437.5000 0.000000 
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8.8 Modeling Fixed Cost Problems 


Minimizing costs is a common objective, among others, in optimization models. 
Many cost functions include fixed costs, as illustrated in Fig. 8.10. In addition, 
some decision problems involve finding optimal integer variable values instead 
of continuous values. For example, allocating fractions of trucks or workers to 
various construction sites in a community makes no sense. Non-negative integer 
variables can take on values 0, 1, 2, etc. Variables having only integer values must 
be specified as such as part of the input to the computer program, such as Solver 
in Excel, used to solve the model. Binary integer variables that can take on only 
values of O or 1 must also be designated as such in computer programs used to 
solve any model containing them. Non-negative integer variables constrained to be 
no greater than 1 can also represent binary (0, 1) variables. 

Many cost functions contain fixed costs, such as shown in the sketch below. In 
this sketch, the variable costs are linear with slopes Cj and the fixed costs are Coi. 

Each cost function i equals 


Costi = Coi + CiX if X > 0. 


=0 otherwise. 


The fixed costs, Coj, only apply if the variable X is greater than 0. If X = 0, 
the Cost; = 0. 

The use of binary variables makes it possible to include such cost functions in 
linear optimization models. For example, suppose one wants to find the minimum 
cost associated with a value of the variable X in Fig. 8.11. The answer is obvious 
just from looking at the two cost functions and picking the one having a lower 
value. If the value of X is to the left of the breakeven point where the two cost 
functions meet and have the same value, clearly Costz having the lower fixed cost 


Fig.8.10 A cost function Total Cost = Fixed Cost + Variable Cost 
showing economies to scale Y ~ 
and fixed initial costs that 
apply if X >0 700 
— 600 
$ 500 
3 
© 400 
8 300 Variable cost 
200 
100 
0 X 
123 45 6 7 8 9 10 11 


Quantity 
Fixed cost, variable cost, total cost 
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Fig.8.11 Two cost functions 


having fixed costs Co; and Cost; 
linear variable costs whose 
marginal values (slopes) are Cor I 
Ci 
Co2 C2 


is cheaper. Otherwise, if the value of X is to the right of the breakeven point, Cost, 
having the higher fixed cost is cheaper. 

If we entered either cost function into a computer to have it identify the cost 
associated with any given value of X, it would give us the correct answer unless 
the value of X was 0. In that case, it would give us the fixed cost. Hence, we 
need some way to let the computer know that if X = 0, the total cost is 0. That 
constraint needs to be included in the model, and ideally, that constraint should be 
linear. 

One approach for doing this is to multiply the known fixed cost by an unknown 
binary variable. Let Z be that binary variable. Considering just one cost function, 
the objective becomes 


Minimize Cost = CoZ + CX, 


for any value of X and where Z can be either 0 or 1. 

When the binary Z variable is 1, the fixed cost, Co, is included in the total 
cost. When the value of Z is 0, it is not included in the cost. Hence, if the cost is 
to be minimized the value of Z will be 0 no matter what the value of X is. The 
challenge is to create a linear constraint that will force that binary variable Z to 
equal 1 when X is strictly greater than 0. Otherwise, as just stated, since the cost 
function is to be minimized, that binary variable will want to be 0. 

If we require 


X < 999Z 


then if the value of X is greater than 0, the binary variable Z must equal 1. This 
constraint also defines the upper bound on the value of X. If there is no upper 
bound, then any large number that will exceed any value X could assume can be 
used. In this example, it is 999. 

This trivial example can be made more interesting by assuming the two cost 
functions shown in Fig. 8.11 represent the cost of buying and operating two cars 
that are for sale. For each car j, the fixed cost, Coj, is the annual value of the 
purchase price, and the variable cost, CjXj is the annual operating cost of driving 
it Xj miles. Car 1 is more expensive to buy but cheaper to operate. Car 2 is less 
expensive to purchase but more expensive to operate. Whichever car is selected, 
it will be driven over a three-year period in which the predicted miles the car will 
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be driven each year will differ. The question is which car will result in a lower 
present value of the total annual cost. 

If there were no difference in fixed costs, it is obvious the car with the smaller 
variable operating cost (slope Cj) would be the less expensive car to buy. But a 
difference in fixed costs makes a difference. If the predicted miles driven include 
some that are less than the breakeven point and others in other years that are 
greater than the breakeven point, which car to buy may not be so obvious. 

To model this problem, let My be the predicted non-zero number of miles that 
are expected to be driven in year y. If there were only one car to consider, then 
the present value of the total cost over the three years is 


Cost = Co + CM1 /(1 +i) + CMo/(1 +i)? + CM3/(1 +i) -R/( + i)? 


where i is the annual interest rate and R is the resale value at the end of 3 years. 
One could plug in the values of Co, C, and R for each car and compare the results 
to determine which car would be less expensive. It would also make sense to 
vary the assumed values of these parameters, along with the My estimates, to see 
how sensitive the decision of which car to buy is to those assumed, but uncertain, 
values. 

Alternatively, one could include both cars in the same model and have its solu- 
tion indicate which car to buy. This requires allowing the use of both cars. Let the 
variable Xj, be the miles driven using car type j in year y. Their sum in each year 
y must be at least My. Let Z; be a binary variable associated with car type j. Now 
the objective of minimizing the present value of the total costs can be written as 


Minimize Cost = ) °| CojZj + $ {GXy/0 +)” }-Rj/C + °Zj 
j y 
Subject to: 


> Xy < (x My or more)z Vj forcing Zj to be 1 if car type j is driven, 
y y 


i.e., any Xjy > 0. 


Xi Xjy = My Vy mileage requirement in each year y. 


Zj is binary Vj associated with fixed cost and resale parameters 


The solution of this linear model, once the values of the fixed and unit variable 
costs, the interest rate, the miles to be driven in each year, and the annual resale 
value, are specified, will show that only one of the binary variable values, Z;, is 1. 
The less expensive car j is the one whose Zj = 1. The constraint Z; + Z2 < 1 insur- 
ing only one, if any, the car is to be used, can be added to the model’s constraints 
but it is not necessary. Selecting both cars, while feasible, would clearly increase 
the total cost. Just how sensitive the choice of car is to the mileage requirements 
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Fig.8.12 Happiness is 
assuming the world is linear! 


will be indicated by the applicable ranges of the dual variable values of the second 
set of constraints. 

There are many more ways of using integer and binary variables in models. 
Chapter 9 contains more information on how various non-linear terms and func- 
tions can be approximated by linear ones using these integer and binary variables. 
Again, the motivation for doing this is evident when trying to solve large non- 
linear optimization problems. At the same time, one should minimize the use of 
integer variables to the extent possible, for they too can challenge some com- 
puter programs designed to solve mixed-integer models containing both continuous 
and integer variables. Rounding continuous variable values to their nearest integer 
values does not always guarantee optimal or even feasible solutions (Fig. 8.12). 


Exercises 


1. Bake Sale 
For a community fundraising event cakes and pies are to be sold. Find how many 
cakes and pies should be baked to maximize total income. 
Let A and B be the number of cakes and B the number of pies produced. The 
following data apply: 


Product A B 


Income per item $6 $8 


Pans required per item 


Labor required per item 2 4 


There are 80 pans and 280 person hours available, and because of limited cake 
ingredients, no more than 50 cakes (A) can be produced. 


2. Diet model 
You manage the local SPCA (Society for the Prevention of Cruelty to Animals) 
that keeps stray dogs. Your dogs need to eat and there are two varieties of dog 
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food available: foods D and C. Their unit costs are $1.10 and $0.90, respectively. 
Your job is to find the least cost combination of pounds of D and C for each dog 
that meets various nutrition constraints shown in the table below. The amounts of 
the ingredients shown are in each pound of D and C. 


Ingredient D C Daily minimum/dog/day 
Protein 3 oz 4 oz 8 oz 
Carbohydrate 5 0z 12 oz 11 oz 
Iron 30 mg 35 mg 100 mg 
(a) First, describe your objective function and constraints in words. 


(b) 


(c) 
(d) 


3. 


Define the parameters and variables, and their units, that you can use to create a 
mathematical model. 

Express the model mathematically. 

Show the solution by plotting the constraints and objective function on a graph 
of D versus C. 


Labor Scheduling 
A social welfare program involves three projects. Projects A, B, and C require 18, 
12, and 30 person months to complete. Four qualified social workers are available 
to work on these projects. 

Their monthly salaries are $3000, $3500, $3200, and $3900, respectively. 

All projects must be completed in 18 months, and each social worker can be 
assigned only to one project in each 6-month period. Multiple workers can be 
assigned to the same project. 

Find the allocation of each worker to each job that minimizes the total cost of 
completing the projects. 


. A transportation problem 


Assume there are 4 warehouses containing Personal protective equipment, com- 
monly referred to as ‘PPE,’ supplies being used at 6 hospitals. Given the supplies 
available at each warehouse and the demand at each hospital, and the unit costs of 
transporting them (all known values), construct a model to determine how much 
gets transported from each warehouse to each hospital that minimizes the total 
transportation costs. 

To do this, you need to make up your notation for all variables and parameters. 
Plug in values of the parameters of the model and solve it to find how much is 
shipped from each warehouse to each hospital. 

What condition must be satisfied for your model to be feasible? 


. Forest management 


A particular State Forest has four different subareas whose characteristics such as 
species composition, age distribution, drainage, soil characteristics, etc., are sim- 
ilar. The areas of these subareas are known. Recent growth studies have produced 
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predictions of the volumes per hectare for each subarea for the next 50 years. The 
forest manager is responsible for defining a cutting schedule that will produce a 
steady supply of logs to be cut into lumber over the 50-year life span of the forest. 
Her goal is to find the maximum constant amount of wood (volume) that can be 
converted to lumber every year. 

Develop a model for determining just how much volume can be cut in each 
subarea in each of 5 10-year periods. Once any area is cut trees in that area 
cannot be cut over again for another 50 years. Cutting trees from the forest in this 
sustainable way increases water yields, the quality of wildlife habitat, and timber 
income. 

Define the variables, parameters, and constraints you need, and use them to 
build and solve a model for identifying the best cutting schedule—i.e., how much 
to cut, where, and when. 


6. Water Quality Management Model 
Find the wastewater treatment efficiencies at sites 1 and 2 that meet stream quality 
standards at sites 2 and 3 at a minimum total cost. Currently, there is no treatment. 
All the wastewaters at sites 1 and 2 are discharged into the stream. 


Wastewater: 


Site 1 100 kg/day Site 3 


Site 2 
Wastewater: 
200 kg/day 
Current Pollutant Concentrations: 58 mg/l 95 mg/l 
Maximum Allowable Concentrations: 18 mg/l 23 mg/l 


Available Data: 

Streamflow = 1000 m?/day at all sites. 1 kg/day/1000 m?/day = 1 mg/l; 

Fraction of waste discharged into the stream at site 1 that reaches site 2: 0.25 

Fraction of waste discharged at site 1 that reaches site 3: 0.15 

Fraction of waste at and discharged into the stream at site 2 that reaches site 
3: 0.60 

Limits of treatment: removal of 30 % required, but no more than 90%, for both 
sites. The initial concentration just upstream of site 1 is 32 mg/l. 

Can you find the least cost solution that meets the quality standards without 
knowing the cost functions for treatment? 
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Some Linearization Methods 


ABSTRACT 


Because linear programming algorithms are so efficient and in widespread use, 
together with the limitations of non-linear optimization solvers applied to large 
models, modelers faced wanting to solve very large models often attempt to 
linearize the non-linear terms in their models. This chapter introduces various 
approaches for accomplishing this, often using binary (0, 1) variables. 


This chapter reviews some methods and approaches for incorporating non-linear 
and other conditions into linear programming models. The motivation, of course, 
is to take advantage of the power of linear programming algorithms in solving 
linear as opposed to non-linear models. 


9.1 If-Then-Else Conditions 


There exist a number of ways if-then-else conditions, i.e., decision trees, can be 
included in linear programming models. To illustrate some of them, assume that 
X is an unknown decision variable in a model whose value depends on the value 
of another unknown decision variable Y. Assume a maximum value that Y would 
not exceed. Let this upper bound be Uy. Similarly, assume a maximum value 
that X would not exceed, Uy. These upper bounds and all the linear constraints 
defining ‘if-then-else’ conditions must not restrict the values of the original deci- 
sion variables X and Y. Four ‘if-then-else’ and ‘and/or’ conditions are presented 
below using additional binary variables and, in the last three examples, continu- 
ous variables. All the X, Y, and Z variables in the constraints below are assumed 
to be unknown. Greek letters are known parameters whose values are less than 
the upper bounds on the variables. These linear constraints would be included in 
models where these if-then-else or and/or conditions apply. 
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(a) If Y < a then X < 8B, else X > y. 
Define constraints: 


Y < œZ + Uy(1 — Z) where Zis a 0, 1 integer variable. 


Y>a(1—Z) 
X < BZ + Ux(1-Z) 
xX=yü-z) 


(b) If Y < a then X < Y, else X > Y. 
Define constraints: 


Y=Y,+aZ, 

Y2 < Uy —a)Zp 

Z| + Z2 < 1 where each Z is a0, 1 integer variable. 
Xı <aZ, 

X<Y 

X2 < Ux% 


(c) If Y < aor Y > 8 then X = y, else X > ô. 
Define constraints: 


Y <aZ, + BZ, + UyA—-Z|—-Zp) 
Y > a% + BU-Z,—-Zo) 
Zi + Zz < l where eachZis a0, 1 integer variable. 


Xi, =yZ 
X = y (1-21-72) 
X3 > ôZ2 


X =X, + X2 + X3 


(d) Ifa < Y < f but (and) not y <Y < ò where y > a and 5 < B, then X < e, else 
X>o. 
Define constraints: 
Y <aZ, + yZ) + 6Z3 + BZ4 + Uy(1 — Zi — Z2 — Z3 — Z4) 
Y > aZ + yZ + ôZ4 + p(1—Z1—Z2—Z3—Z4) 
Zı + Z2 + Z3 + Z4 < | where each Zis a0, | integer variable. 
Xi < eZ) 
X2 < €Z4 
X3 > Pl — Z2 — Z4) 
X3 < Ux (l — 72 — Z4) 
X =X, + X2 +X 
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Fig.9.1 A fixed cost 
function having linear 
variable costs 


9.2 Fixed Costs in Cost Functions 


The cost function: Cost = Co + C X if X > 0, but equals 0 otherwise, includes a 
fixed cost Co (Fig. 9.1). 

To include such cost functions in a linear optimization model, define Cost = 
Co Z + C X and constrain X < M Z, where M is the upper bound of X, and Z is 
an unknown 0,1 variable. 


9.3 Minimizing the Maximum or Maximizing the Minimum 
of a Set of Unknown Variables or Functions 
Let the set of variables be {X1, X2, X3, ..., Xn} 
Minimize maximum {X1, X2, X3,..., Xn} is equivalent to : 
Minimize U subject to U > Xj,j = 1,2,3,...,n. 
Maximize minimum {X1, X2, X3,..., Xp} is equivalent to : 
Maximize L subject toL < Xj,j=1,2,3,...,n. 
The same applies to a set of functions fj(X) of unknown decision variables 
contained in the vector X. 
9.4 Minimizing the Absolute Value of the Difference 
Between Two Unknown Non-negative Variables 


Minimize |X — YI is equivalent to 


Minimize D 
subjecttoX — Y < D; Y — X < D; X,Y,D >Q. 
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or 


Minimize (PD + ND) 
subject toX — Y = PD — ND; PD, ND,X,Y => 0. 


9.5 Minimizing Convex Functions or Maximizing Concave 
Functions 


See Figs. 9.2, 9.3 and 9.4. 


Maximize G(X ) = Maximize B 


£020103v 


variables: xy XQ X3 X1 x? x3 
slopes: Sy S2 S3 Sy S2 S3 
segments: 1 7) 3 1 2 3 

3 3 

u o 

i f 
0 a b 0 a b 

3 —> x —> x 


Fig.9.3 Piecewise linear approximations of convex (F(X)) and concave (G(X)) functions 
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Fig.9.4 Piecewise linear approximations of convex (F(X)) and concave (G(X)) functions. 
Unknown weights are assigned to each segment endpoint 


Subjectto: I, + SıX > B 
b +S2X >B 
I + S3X > B 


Minimize F(X) = S,x; + S2x2 + S3x3; Maximize G(X) = Six + S2x2 + $3x3 


X =x, +x. +3; x1 < a; X2 < b—a. 
Using unknown weights: 
Minimize : F(X) = F(0)wı + F(a)w2 + F(b)w3 + F (c)w4 


Maximize : G(X) = G(0)wı + G(a)w2 + G(b)w3 + G(c)w4 
X = Ow, + aw2 + bw3 + cw4; wi + w2 + w3 +w4 = 1 


9.6 Minimizing Concave Functions or Maximizing Convex 
Functions 


See Fig. 9.5. 


Minimize G(X) = 5x1 + (20z2 + 3x2) + (44z3 + 2x3) 
Subject to : 
xı + (4z2 + x2) + (12z3 + x3) = X; zs = 0 or 1 for s = 1,2,3. 
x1 < 4z1; x2 < 8z2; x3 < 99z3; z1 + z2 +23 = 1. 
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Fig.9.5 Concave function G(X) 


9.7 Minimizing or Maximizing Combined Concave-Convex 
Functions 


See Figs. 9.6, 9.7 and 9.8. 


Maximize C(X) = (5z1 + 6x; + 3x2) + (53z3 + 5x3) 
Subject to : 
(x1 +x2) + (12z3 + x3) = X; 
X1 < 4z1; x2 < 8215 x3 < 9923; z1 + z3 = l; z1, 23 = 0, 1. 


Maximize C(X) = (5z1 + 6x1) + (29 z2 + 3x2 + 5x3) 
Subject to : 
xı + (4z2 + x2 + x3) = X; 


Fig.9.6 Mixed Concave and 
Convex function C(X) 


9.7 Minimizing or Maximizing Combined Concave-Convex Functions 117 


xi < 4z; x2 < 822523 < 992321. +22 < 1; 721,2= 0, 1. 


Maximize or Minimize F (X) 
F(X) S (5z1 + 6x1) + (35z2 + 3x2) + (3223 — 2x3) + 22z4 
Subject to : 

xı + (4z2 +.x2) + (1223 + x3) + (17z4 + x4) = X; 


x1 S421; x2 < 829; x3 < 5z3; x4 < 99z4; J zs = l; zs = 0, 1 Ys. 
S 


Maximize C(X) = (5z1 + 6x1 + 3x2) + (—17z3 + 5x3) 
Subject to : 
(x) + x2) +23 = X; z1, z3 = 0,1. 
x1 < 4 z1; x2 < 8215 x3 < 99z3; z1 + z3 = 1. 


Fig.9.7 Discontinuous 
piecewise linear function 


Fig.9.8 Mixed 
concave—convex piecewise 
linear function 
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Maximize C(X) = (5z1 + 6x1) + (17z2 + 3x2 + 5x3) 
Subject to : 
xı + (422 + x2 + x3)=X; 21,22 = 0,1. 
Xy < 4zi; x2 < 12z1; x3 < 992; z1 +22 < 1. 
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1. Groundwater pumping: 
This is an exercise in the use of fixed costs and piecewise linear variable costs. 
(a) Show how you would include the following cost functions, C(S) as shown in 
the figure, in a linear optimization model. 


ON h a U pa 


Fixed = 0, variable = 10, 

Fixed = 0, variable = 5, 

Fixed = 0, variable = 8 to S = 5, then 15. 
Fixed = 20, variable = 5, 

Fixed = 14, variable = 4 to S = 6, then 12, 
Fixed = 20, variable = 5 to S = 7, then 3. 
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Cost 


Supply S 


(b) Develop models for finding the minimum cost to meet a demand from 
two sources of groundwater using pairs of cost functions given above and 
assuming known maximum flow capacities at each well field. 

Assume: 


Qa = flow from source A — unknown m? /day, 

Qb = flow from source B — unknown m?/day, 

Ca(Qa) = cost function, as above $, 

Cb(Qb) = cost function, as above $, 

Demand = required to be met m?/day, 

Ka, Kb = maximum flow capacity of well fields A and B, respectively, 
m?/day. 


(c) Now consider increasing demands for flow over time. Develop a model that 
finds the minimum cost pumping schedule over time. Just assume Ca() and 
Cb() as the cost functions for adding additional flow capacity in any period t. 


2. Capacity expansion problem 
To meet the growing demand for public housing, a community has decided to 
build more housing units. There are two sites where this can be done, and the 
question is which site is less expensive over time. Assume these sites are named 
A and B. Let A(t) and B(t) be the capacity of each of those sites at the beginning of 
period t. Let KA(t) and KB(t) be the added capacity in period t, costing Ca(KA(t)) 
and Cb(KB(t)). Construction periods last 5 years; hence each period t will be a 
5-year period. Costs must be paid at the beginning of each period. 
Cost functions: 


Ca(KA(t)) = 15 + 8 KA(t) if KA(t) > 0; otherwise = 0. 
Cb(KB(t)) = 5+ 9 KB(t) if KB(t) > 0; otherwise = 0. 

Assume these apply in each period t. 

r = annual interest rate. Discountfactor : 1/((1 + r) A (5 x (t — 1))) 
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Projections of future demands for public housing have been made. Estimates 
of total capacity requirement are: 


End of period 1 5 

End of period 2 10 
End of period 3 18 
End of period 4 33 


Solve using linear programming, and show the sensitivity of the solution to 
the value of the annual interest rate r. 


3. There are two users of resources, A and B, whose income depends on the resources 
they are allocated. Let those allocations be A and B, respectively. The income to 
user A equals 10A-0.5A?. The income to user B is 5B-0.25B?. 

(a) What are the allocations that result in the maximum total income? 

(b) If you have only 14 resources to allocate, show how you could get an 
approximate solution using linear programming. 

(c) Show how the model could be modified to obtain the maximum equal income 
for both users. 
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Solving Models Using Calculus 1 0) 


ABSTRACT 


Solutions to many economic models are based on marginal values of functions, 
such as marginal costs, marginal benefits, and marginal net benefits, or whatever 
the function being maximized or minimized represent. These marginal values 
are the slopes of functions. This chapter introduces how to use differential cal- 
culus to find the slopes and solutions to problems characterized by continuous 
non-linear functions. The reverse, called integral calculus, is also introduced 
for finding areas under functions that can represent total costs, benefits, and/or 
other values such as probabilities that are discussed in later chapters. 


10.1 Introduction 


Many optimal solutions of models having continuous non-linear objective func- 
tions are based on the slopes of those functions rather than the functions 
themselves. Slopes are the change in the function value per change in the value of 
the function’s argument. If the function is f(x), its slope is A(f(x))/Ax. The max- 
imum or minimum value of a function is when its slope is 0. The hill climbing 
approach used in Chap. 4 to solve a discrete version of the resource allocation 
problem involved finding the steepest slope of multiple user benefit functions and 
making an allocation to the user having the steepest remaining slope. The bene- 
fit-cost example introduced in Chap. 6 involved finding the allocation where the 
slopes of the benefit and cost functions were equal. In fact, the optimal solutions 
to both problems occurred when the slopes of the objective functions were equal. 
Slopes play a significant role in economic decision-making. Economists call these 
slopes marginal values, such as marginal benefits or marginal costs or marginal 
yields. This chapter introduces ways of finding slopes of continuous functions and 
how they can help us address various policy issues. To do this, we can use some 
procedures included in what is termed differential calculus. 
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Fig.10.1 Founders of the ‘mathematics of change,’ Gottfried Wilhelm Leibniz and Isaac New- 
ton. (Image: Christoph Bernhard Francke/Public domain, Image:Dr Project/Shutterstock, Image: 
After Godfrey Kneller/Public domain) https://www.thegreatcoursesdaily.com/invented-calculus- 
newton-leibniz/ 


This chapter assumes that many using this book may not have had much if any 
calculus and hence this basic introduction may be helpful. If you already know 
this subject, you can probably skip this chapter and go on to others (Fig. 10.1). 


10.2 Finding Slopes 


Differentiation is a method of calculus that lets us find the slope of any point on 
a function. If we are interested in finding the maximum value of the non-linear 
function f(x) shown in Fig. 10.2, we know that happens when the slope is 0, so we 
can use differentiation to find the function that is the slope of the original function, 
f(x), and then set that slope function equal to 0 and solve for x. The value of x 
where the slope of f(x) = 0 is where the black dot is in Fig. 10.2. 

Slopes define the rate of change of a function f(x) at any point on the func- 
tion, i.e., at any value of x. As the function’s variable value, i.e., x, changes, the 
function’s slope may also change, such as is the case in Fig. 10.2. 


Fig.10.2 A concave 

function f(x) whose f(x) 
maximum value is indicated 

by the dot. At this point, the 

slope is 0 
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An easy way to find slopes for any continuous function is by differentiation. 
Differentiating a function results in another function whose value for any value x is 
the slope of the original function f(x) at x. This function is known as the derivative 
of the original function and is denoted by either a prime sign, as in f’(x), or by the 
differential operator notation, df/dx. The operator ‘d’ replaces the change notation 
‘A’ as in Af(x)/Ax and signifies what the change in f(x) is as Ax goes to 0. 

The slope of any continuous function f(x) at any value of x is a line tangent to 
it at that value of x such as shown in Fig. 10.2. The slope of the tangent line is 
the slope of the function at that value of x. 

If the function is concave, as shown in Fig. 10.2, its slope decreases as x 
increases. The slope of a convex function increases as x increases. The slope, 
also called the gradient, of a function, tells us how steep the function f(x) is at a 
particular value of x. A linear function, i.e., a horizontal line has slope 0; a line 
with a positive slope increases in value as x increases. A line with a negative slope 
decreases in value as x increases in value. 


10.3 Maxima and Minima 


Finding the value of x of a function f(x) that results in a 0 slope does not always 
guarantee a maximum or minimum of the function. The function may have mul- 
tiple values of x that result in slopes of 0. For now, this is just a warning that 
finding the value(s) of x where the slope of f(x) is 0 does not always tell us what 
we want to know without some additional tests to be sure the solutions are indeed 
global, rather than a local maxima or minima, or whether it represents a maximum 
or minimum (Fig 10.3). 

One way to know if a point on a function where the slope is 0 is a true maximum 
or minimum is to just graph the function and see if it looks like a global maximum 
or minimum. You can also find the slope of the function for a slightly smaller 
value of x to determine if the function was at a maximum or a minimum value. If 
the newly computed slope is positive, the zero-slope value of the function was a 
maximum. Otherwise, the function was at a minimum value. 


Fig.10.3 A graph of a 
function having local 
maximum and minimum 
values. At those points, the 
slopes of the function are 0. 
The global maximum (y = 
10) and minimum (y = —10) 
points are at the end points of 
the function where the slopes 
are not zero 
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10.4 Finding Slopes Using Differentiation 


A derivative of a function defines its slope. The derivative of a function is another 
function that is the slope of the original function. For example, consider the func- 
tion 5x’. Its slope at any value of x is found by differentiating it, i.e., by finding 
d(5x*)/dx. Most of the functions we will be working within this book are power 
functions having terms of the form ax? where ‘a’ and ‘b’ are known constants. 

Consider the function f(x) = ax?. The slope of this power function is found in 
two steps: 


(1) Multiply the term by its exponent b, so ax” becomes bax? 
(2) Subtract 1 from the exponent, resulting in bax®—!. 


This is the slope of ax? for any value of x. Differentiation is as simple as that for 
continuous power functions. Even constants can be expressed as a power function. 
Any constant C is also Cx? since any term raised to the Oth power is 1. Hence, the 
slope of any constant C is 0. The linear function 2xcan also be expressed as 2x! 
and hence its slope is 1(2)x!~! or 2. 

The slope of this ‘slope function’ is the derivative of a derivative, called the 
second derivative, which is designated as d?f(x)/dx?. 

d’fldx* = d ((dfldx)/dx\/dx = d[bax~ V/dx = a(b)(b— 1) x®~?. And so on 
for the nth derivative. 

The slope of a function that is the sum of multiple terms is found by replacing 
each term with its derivative. For example, the slope of 7 + 4x!° is 0 + 6x°°. 
This example illustrates the fact that the slopes of functions containing constants 
are not affected by the constants. Marginal costs are not impacted by fixed costs. 
Derivatives of constants, including fixed costs, are always 0. 

There are other shortcuts to differentiating more complicated combinations of 
functions that one can learn from textbooks in calculus. Probably the biggest short- 
cut one can take to find a derivative is to access one of many programs available 
on the internet for differentiating user-provided functions. 

Before leaving this subject, we need to cover what is termed partial differenti- 
ation of multivariable functions. 


10.5 Partial Differentiation 


For multivariate functions having more than one unknown variable in them, one 
can find the slopes associated with each variable independently of the others. For 
example, consider the function f(x, y) = 5 + 3(xy). The partial derivative of f(x, y) 
with respect to x (assuming y is a constant) is df/dx = 3y. The partial derivative 
of f(x, y) with respect to y (assuming x is constant) is df/dy = 3x. 

For partial derivatives, we replace the differential operator d as in dx with 0 as 
in 0x to indicate that it is a partial differentiation. 
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To illustrate, consider the two-variable function f(x,y) = 5 + 3(xy)?, which is 
the same as 5 + 3(xy?). 


df /ox =3 (2x y’) = 6x y2. Partial derivative with respect to the variable x. 
df/doy =3 (2x? y) = 6x? y. Partial derivative with respect to the variable y. 


10.6 A Review 
For a review, assume f(x) = 9 + 3x7? + 5x4. 


df/dx = —6x7? + 20x? First derivative. 
df? /dx? = 18x74 + 60x? Second derivative. 
df /dx? = —72x-> + 120x Third derivative. 


Finally consider f(x, y) = 5 + 3(x + y)*, which is the same as 5 + 3(x? + 2xy 
+ y’). 


ð f/ðx = 3(2)(x + y)! 1 = 6(x + y) Partial derivative with respect to the variable x. 
Ə f/ðy = 3(2)(x + y)! 1 = 6(x + y) Partial derivative with respect to the variable y. 


10.7 Derivative Notation 


See Table 10.1 


Table 10.1 Differential calculus notation. The variable y = f(x)’ 


a Derivatitve Derivative—Leibniz’s notation |d (32°) /dx = 9x? 
A 
oy Second derivative Derivative of derivative d? (3x) / dx? = 18x 
ry nth derivative n times derivation 
y Time derivative Derivative by time—Newton’s 
notation 
ï Time second derivative | Derivative of derivative 
iio Partial derivative a(x? + y?) /3x = 2x 
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10.8 Integration 


Integration is just the reverse of differentiation. Differentiating a function gives us 
the equation for the slope of that function. For example, the slope of the function 
x? is d(x*)/dx = 2x. Integrating a function finds the original function for which 
the existing function, e.g., 2x, is its slope. The process of integration is simply the 
reverse of what it is for differentiation, with an addition of a constant. 

To differentiate x7, we first multiply the function by its exponent, 2x”, and then 
subtract 1 from the exponent, to get 2x. To integrate 2x, which is 2x!, we first 
add one to the exponent, 2x!+! = 2x?. Then we divide the function by the new 
exponent, getting 2x?/2 = x*. But we also need to add a constant, say C, which is 
the value of the function x* when x = 0. In this case, C is obviously 0. So we end 
up with x? + C, and when this is differentiated it becomes 2x. Differentiating C 
+ 5x3 results in 0 + 15x2. Integrating 15x? results in 15 x?*!/(2 + 1) = 5x? plus 
a constant C. 


10.8.1 An Exception 


Consider integrating ax~! or equivalently a/x. In this case, the result would be ax°/0 
= a/O since the exponent b = —1. Hence, in this case, the rules for integration 
do not work. The function’s correct solution is the constant ‘a’ times the natural 
logarithm of x plus a constant C (a In x + C). The term Ín x is the exponent of 
the base of natural logarithms, e, (=2.718281828.) that results in x. Note e” = x 
When x = 1, In 1 = 0. e? = 1. When x is e, ne = 1. e! = e. 

If we were working with logarithms of base 10, then 10°8* = x. The log of 
1 is 0 since 10° is 1. The log of 10 is 1 since 10! is 10 and the log of 100 is 2 
since 10? is 100. Again, the logarithm of some number x is the exponent of, in 
this case, 10, which results in the value x. The natural logarithm is the exponent 
of e that results in some value of x. The base of logarithms can be either 10 (when 
the term ‘log’ is used) or e (when the term ‘In’ is used). 


10.8.2 What is Integration? 


The upper case sigma, &, signifies a sum. If we were adding a series of discrete 
values of some function g(x;) we would write it as Xj g(xj). Whatever those values 
are they can be expressed as g(x;)/Ax;, where the function g(xj) is constant over 
the interval Ax;. These discrete values can be considered rectangles having heights 
equal to the g(xj) and widths equal to the Ax; such as shown in Fig. 10.4. 

The sum of the areas in the rectangles shown in Fig. 10.4 is an approximation 
of the area under the continuous function 2x. Since the function 2x is a continuous 
function of x, the smaller the widths Ax are the more accurate will be the estima- 
tion of the area under the function. This is evident when computing the area under 
the function shown in Fig. 10.5. 
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x* xX 


m 


Fig. 10.4 A series of discrete rectangles having heights g(x) and widths of Ax for discrete values 
of x 


ih i A 


Fig. 10.5 Computing the area under a function becomes more accurate the smaller the width of 
each rectangle becomes 


Assuming all Ax are 1, the area of each rectangle in Fig. 10.4 is 2x. The sum 
of the areas over each value of x from 1 to 5 is expressed as 


5 
X2x Ax = 24+44+64+8+410= 30. 
1 


As Ax gets smaller, the area between 0 and 5 converges to its true value of 
0.5(5)(10)) = 25. As Ax approaches 0, it becomes dx, and the integral sign, f, 
replaces the X sign. Hence, the area under the function g(x) = 2x from x = 0 to 
x = Sis 


5 
[hrca =5 =25. 
0 


If g(x) is the function that defines the slope of another function f(x), then the 
equation defining the area under the slope function is the function f(x). Hence, if 
f(x) = x’, then its slope is d(x?)/dx is 2x. The triangular area from x = 0 to some 
value of x = x* under the function 2x is obviously 0.5(4*)2x* = x*?_ The area 
under a slope function is the value of the original function. f (d(f(x)/dx) dx = f(x). 
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10.8.3 Integrating Over Ranges of a Variable or Function 


f (15 x?) dx = 5x? + C is an example of indefinite integration. The value of x has 
no limits. 

If x ranges between a and b, then the area under any continuous function g(x) 
between x = a and x = b is determined by the definite integral 


b 
f swa = fsa evaluated at x = b — J g(x)dx evaluated at x = a 


a 


Thus 


[x? + Cll rab — [x? + Cll aa = b? — 2°. 


10.8.4 Other Examples of Integration 


Some functions may have multiple terms of the form ax®. In this case, integrating 
each one separately will result in the integral of the entire function. For example, 
assume f(x) is (5 + 3x — 2x”)*. When expanded it becomes 25 + 30x — 11x? = 
12x3 + 4x*. Differentiating each term of f(x) results in 2(5 + 3X — 2X*)(3 — 4X) 
or 30 — 22x — 36x” + 16x°. 

Integrating the function (30 — 22x — 36x? + 16x°) involves integrating each 
term. 


f (30 —22x—36x? + 16x°)dx = 30x—11x?—12x7 +4xf + C. 


The constant C can be determined by referring to the original function f(x) = 
(5 + 3x — 2x*)* and setting all the variables x to 0. This identifies C to be 5? 
or 25. Thus, the integral of a differentiated function d(f(x)/dx is the function f(x) 
itself. 

There are many functions that do not easily convert to a series of terms that are 
easily integrated. The internet not only provides many examples of differentiation 
and integration, but also contains programs that will do the differentiation or inte- 
gration of user-provided functions. So, if you are stuck, go to the internet but you 
should be able to perform those operations on the types of functions illustrated in 
this chapter. 

The tables below are for your reference if needed (Tables 10.2, 10.3 and 10.4). 
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Table 10.2 Notation used for integration 
J integral Integration involving one variable 


Jf double integral Integration involving two variables 


Table 10.3 Some common indefinite integrals. The ‘In’ in this table refers to natural logarithms 
having e as its base 


ax?t! 


fax’dx = ber +C 
Ja(x7!)dx =aln|x|+C 

fab cx)!dx = at In|b Hex] +C 
fa(b+cx)7dx = 


UZEE HC 


Table 10.4 Some rules are satisfied by definite integrals 


a b 
1 Order of S fœdx =- f fœ)dx A definition 
integration b a 
a 
2 Zero width f f(x)dx =0 A definition when 
interval a f(a) exists 
b b 
3 Constant JS kfœ)dx =k f f(x)dx Any constant k 
multiple 4 4 
b b b 
4 Sum and fF@)+4e@))dx =f fœ)dxt f g(x)dx 
difference 4 a g 
b ë c 
5 Additivity S f@dx+ f f@)dx =f fœ)dx 
a b a 
Exercises 
0 Warmup. 


The following examples show that if you want to compute the average value of 
a function over a range of values, you want to compute the average of different 
functional values rather than computing the function’s value of the average input 
value. 

Consider each of these functions: 
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18.75 10x — X? concave 
13:949 5X linear 
12.5 
Sik X? convex 
6.25 
0 2.5 5 x 
X 10X-X? 5X x? 
0 0 0 0 
1 9 5 1 
2 16 10 4 
3 21 15 9 
4 24 20 16 
5 25 25 25 
Arithmetic- 15/6 95/6 75/6 55/6 
Mean, AM 2.5 15 5/6 125 9 1/6 
Note that: 


For concave functions: 
Mean of function values < function value for mean x 
155/6 < 10(2.5) — 2.5? = 18.75 
For convex functions: 
Mean of function values > function value for mean x 
91/6 > 257 =6.25 
For linear functions: 


Mean of function values = function value for mean x 
12.5 = (5)2.5=12.5 


Show that the true mean is between these two values for each function. 
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1. Benefit—Cost analysis. 
Assume a benefit function B = 60*x^0.8 and a cost function C = 4 + 7*x*1.5. 
The difference between B and C is the net benefits. 
(a) Find the value of x that results in the maximum net benefits. 
(b) Would an increase in the fixed cost of 4 affect the value of x? 

2. Water supply utility. 
You are a mayor of a town that is considering privatizing the public water supply 
system. Currently, the public water supply system is operating in such a way that 
maximizes the benefits to its consumers (willingness to pay) while still paying for 
the service. No profit is made. If it is privatized, the private company will want 
to maximize its profits (revenue less costs). 

For example, consider the functions shown below: 

The horizontal axis is the amount of water delivered, and the vertical axis is 
money representing the unit price of water charged, the total and marginal costs, 
and the total and marginal revenue. 

Willingness to pay is the area under the demand curve. 

Assume the public utility objective is to maximize willingness to pay less the 
cost of supplying water. 

Assume the private utility objective is to maximize total revenue less the cost 
of supplying water. 

The total revenue is the unit price times the quantity Q sold. 
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Demand Function Q = f(P) 


Unit 
Price;P Total consumer benefit 
“Consumer surplus” 
Unit g P 
CostC ` 
Poub 
‘ Qpub Water Quantity Q 
Total cost. 
Public Utility 
Unit ; 
Price P Demana Function Q = f(P) 
Unit „~ Consumer surplus (benefit) 
Coste N Marginal Total 
Pori X“ Revenue 
Pagasa REN. -.-.- ; 
Profit Í f Qpri Qp» Water Quantity Q 
“Producer’s surplus” / 


Total cost. 


Private Utility 


For an amount of water, Q assume the total cost = 5Q and the demand function 
= unit price = 12 — 1.5Q. 


*Total Cost 


Unit Price 


Quantity Q 


Given these data, find the best amounts of water to deliver and the associated 
unit prices to charge for both public and private utility. The public utility should 
maximize consumer surplus less its costs, and the private utility will maximize its 
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producer surplus or profit subject to any regulations it must meet. In this example, 
there are none. 

Find the solutions and graph the solutions like the figures above. Identify on 
the graph the consumer’s surplus, producer’s surplus, and total cost. 

For a public utility, what should the unit price be for the water supplied, and 
how does it compare to the marginal cost? 

For a private utility, what should the unit price be for the water supplied, and 
how does it compare to the marginal cost? Hence, what is the unit and total profit? 
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Lagrangian Models 1 1 


ABSTRACT 


Lagrangian models use calculus to solve multi-variable non-linear constrained 
optimization models of problems and for identifying the marginal changes 
(‘shadow prices’) of optimal solutions to changes in constraint bounds. This 
is especially useful when the constraints represent resource limitations. 


11.1 Introduction 


Joseph-Louis Lagrange is usually considered to be a French mathematician, but 
the Italian Encyclopedia refers to him as an Italian mathematician who lived from 
1736-1813. Among numerous other honors, a street, Rue Lagrange, in Paris, is 
named after him (Fig. 11.1). 

Joseph-Louis Lagrange is famous for lots of things he did in his life, but for us, 
he showed how differential calculus could be used to find solutions to constrained 
non-linear models. But, since it is calculus-based, all the limitations of calculus 
apply. It is limited to continuous functions. It produces a maximum for objective 
functions that are concave and a minimum for objectives that are convex. It ignores 
constants. But it gives us an opportunity to better understand the concept and find 
the values of shadow prices. 


11.2 Constructing Lagrangian Optimization Models 


Lagrange’s approach for finding the maximum or minimum value of some objec- 
tive function F(X) and associated values of all the decision variables X = {x1, 
X2, X3,.., Xj, ... Xn} also determines what economists call ‘shadow prices,’ and 
operations researchers call ‘dual variables’ or ‘dual prices,’ associated with each 
constraint, gj(X) = b;. Each constraint’s shadow price is the change in the value of 
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a 
LY 


S* Arr: \ 
RUE 
LAGRANGE 


Fig.11.1 Joseph-Louis Lagrange and a street in Paris having his name. https://en.wikipedia.org/ 
wiki/Joseph-Louis_LagrangeCreativeCommonsAttribution-ShareAlikeLicense 


the objective function F(X) given a unit change in the constraint’s b; value. These 
shadow prices are also called Lagrangian multipliers, typically denoted as Aj. 


ài = dF(X)/db; for each constraint i. 


The modeling approach involves combining the objective function F(X) and all 
the constraints expressed as equalities, gi(X) = bj, into a single function L(X, A). 
The unknown variables are the original ones contained in the vector X and all the 
Lagrange multiplier variables, ài, one for each constraint i. 

Each constraint gi(X) = b; in L(X, X) is multiplied by its Lagrangian multiplier 
ài. Their sum is subtracted from F(X). The result is 


L(X,A) = F(X) — $ Ai(g(X) — bi). 


Setting inequality constraints originally of the form gi(X) <b; or gi(X)> bj to 
equalities when one is not sure if they are equalities or not, may involve the addi- 
tion, or subtraction as appropriate, of the square of an additional unknown slack or 
surplus variable. For this discussion, assume such variables if needed are included 
in the vector X. These so-called slack or surplus variables are squared to insure 
each is non-negative. 

Equating to 0 each of the partial derivatives of L(X, à) with respect to each 
of the unknown variables, in X and A results in a set of equations that when 
simultaneously solved will identify the values of each of the unknown variables 
in X and shadow prices in A that maximize or minimize F(X). The procedure is 
the same whether the objective function F(X) is to be maximized or minimized. 
Therefore, one should check to see if the solution is a maximum or minimum 
value. Again, the A; values are the shadow prices associated with the b; values of 
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each constraint i. 
LX, Y = F(X) — X a(g &) — bi) 
i 
dL/dx, = 0 = AF/dx; — X > Aid (g:1X))/3x; for all variables xj 
dL/dA, = 0 = (siX) — bi) for all constraints i. 


Before showing some specific examples, consider the constraints where a sur- 
plus or slack variable, xi?, had to be added or subtracted from the left-hand side 
of a constraint to form an equality. When the partial derivative of L(X, à) with 
respect to that variable is set to 0 the result is 


OL/dxj = 0 = —2 x Aj. 


Note that either x; or Aj or both will equal 0. If the constraint is binding there 
will be no inequality, and the value of the slack or surplus variable x; will be 0. If 
the constraint is not binding (does not affect the values of the other variables xj) 
then A; will equal 0. There will be no change in F(X) given a small change in bj. 


11.3 Example Lagrangian Models 


Consider finding the minimum length of fence needed to enclose a rectangular 
area of at least A or of finding the maximum rectangular area that can be enclosed 
by a fence of length P. The area’s perimeter = 2(length) + 2(width). Clearly, the 
solution is length = width = \/A. Solving a Lagrangian model will also identify 
the Lagrangian multipliers, i.e., the shadow prices, associated with the available 
resource, i.e., area or length of fencing. Letting / and w be the unknown length 
and width of the rectangular area A 


L=2(1+w) —Adw—A) 
aL/al = 0 =2— Aw 
aL/aw =0=2-Al 
aL/aa = 0 = Iw — A. 


One can see from the first two partial differential equations that / = w, thus 
from the third partial differential equation, Z and w = ./A. Hence, the shadow 
price associated with the area, à, is 2/,/A. This is how much more fencing is 
required for a unit increase in A. 
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Alternatively, if the total length of fencing is P, we can find the maximum area 
A (lw) enclosed by P 


L=lw —A(2(1+ w) —P) 
aL/al =0 =w- 2A 
aL/aw = 0=1— 2A 
aL/ar = 0 = 201+ w) — P. 


Again, from the first two partial differential equations, / = w, and from the 
third, l or w equals P/4. Hence the shadow price associated with the perimeter P, 
^, equals P/8. This is how much more area is obtained for a unit increase in P. 

This model can be extended to one of finding the least-cost dimensions of a 
storage tank containing a volume of V. Let the average cost per unit of the base 
area = Cb. Similarly, let the average cost per unit side area = Cs and the average 
cost per unit top area = Ct. Before we can model this tank, we need to decide 
on its shape. Here we can consider two different tank shapes, a rectangular and 
a cylindrical one. Of course, it is possible to pick anything in between these two 
shapes. The stated objective is to 


Minimize Totalcost 
Subject to: Totalcost = basecost + sidecost + topcost 


Volume of tank > required volume 


Assuming a rectangular tank having dimensions of L, W, andH : 
Basecost = Cb LW 
Sidecost = Cs2 H (L+ W) 
Topcost = CtL W 
LWH >V. 


Assuming a cylindrical tank having dimensions Rand H : 
Basecost = Cb(zR°) 
Sidecost = Cs(27 RH) 
topcost = Ct(x R?) 
rR H > V. 


Clearly, for each of the two-volume capacity constraints, the least-cost solution 
will result in; an equality. Constructing a bigger tank than required just costs more. 
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Using Lagrangian models. For a rectangular tank with dimensions 1, w, and h: 


L=2Csh(l + w) + (Cb+Cthlw—a(dwh-— V) 
dL/al = 0 =2Csh+ (Cb+Cthw —Awh 
dL/bw = 0=2Csh+ (Cb+Ct)l—Alh 

ƏL/ðh = 0 = 2Cs(1+w)—Alw 
dL/dA =O =lwh—-V 


From these first three partial differential equations, one can prove that the width 


w equals the length /, and that both = 2 Cs h/(Cb + Ct). Since from the last 
equation, h = V/ww, substituting that into {w = / = 2 Cs h/(Cb+Ct)} yields 


w = l = [2 Cs V/(Cb+ Ct]? 
and 
h = Vol/[2 Cs V/(Cb + Ct)? or h = V!? [(Cb + Ct)/2 Cs]? 


The shadow price associated with volume V will denote the change in the total 
cost per unit change in volume V. The total cost increases if V is increased. What 
is interesting about all such tank or container problems is that the ratio of base 
and top cost to total cost will equal 1/3 no matter what the tank shape and unit 
costs and volumes are. The total side cost will always be 2/3"s of the total cost 
of a minimum-cost tank or container. 

For a circular tank with dimensions’ r and A : 
L = Cs2arh+ (Cb + Ct) xr?— à rrh— V] 
dL/dr = 0 = 2Csha +2(Cb + Ct)ar — 2Am4rh 
dL/ar = 0 =2Csrr — à nr 

ðL/ðà = 0 =x rh- V. 


Use first two partial differential equations to find that r/h = Cs/(Cb+Ct) 
Using the third equation 


r = [V Cs/x (Cb + Cr]! 
h=[V(Cb+C?/rcs?]'". 
Now look at cost ratios: 


Side cost/total cost = Cs 2x rh/[Cs 2mrh+ (Cb + Ct) zr] 
= 2Csh/[2Csh + (Cb + Ctr] 
= 2 Cs h/[2 Cs h + (Cb + Cthh Cs/(Cb + Ct)] = 2/3. 
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This is true regardless of unit costs Cs, Ct, Cb or V! 

Finally, consider the resource allocation problem. 

Assume the goal is to maximize the total benefits derived from the allocation of 
resources to three users. Denote the allocations as X, Y, and Z to users 1, 2, and 3, 
respectively. The benefits obtained from each allocation are 6 X — X?, 7Y — 1.5Y?, 
and 8Z — 0.5Z. Assuming that only 6 resources are available, the problem is to 
find the allocations that 


Maximize F(X, Y, Z) = (6X — X?) + (7Y — 1.5Y°) + (8Z — 0.52”) 
Subjectedto: X + Y+Z=6. 


The resource constraint is an equality since more resources are desired, and 
therefore, all 6 resources will be allocated. 

The marginal benefits (slopes of the benefit functions) associated with each 
respective user are 6 — 2X, 7 — 3Y, and 8 — Z. When these slopes equal each other, 
the corresponding allocations will maximize the total benefits. This can be shown 
by just constructing a Lagrangian model. 

The Lagrangian equation can be written as 


L=6X — X? +7Y — 1.5Y° + 8Z — 0.57? -A(X +Y +Z- 6). 


Differentiating with respect to each unknown (X, Y, Z, à) and setting the result 
to 0 


ƏL/ƏX =0=6—2X —ìÀ 
əƏL/IY = 0 =7-—3Y -À 
əƏL/ƏIZ = 0 =8-Z-—ì 
ƏL/ðA =0 =X +Y+Z-6 


The first three partial differential equations show that the slopes of each benefit 
function, the marginal benefits, at the optimal solution, are all the same, and equal 
to à. Using this information in the last equation, X = 1, Y = 1, Z = 4, and thus 
the shadow price, dF/d6 = à = 4 and the total benefits are 34.5. If the available 
resources were 6.1, the total benefits would be 34.9. 


Wow! 
I have a 
shadow 
price. 
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Exercises 


1. Benefit Cost analysis 


Assume a benefit function B = 60*x^0.8 and a cost function C = 4 + 7*x*1.5. 
The maximum difference between B and C, the maximum net benefits, occurs at 
x = 8.7686. 

(a) Would an increase in the fixed cost of 4 affect the value of x? 

(b) Use a Lagrangian model to find the value of the shadow price, or Lagrangian 
multiplier, if x cannot exceed 5. What does the multiplier signify? 

2. Allocating resources 

(a) Consider the problem of allocating resources to three users. The allocations 
are X, Y, and Z. User 1’s total revenue is 6X/(1+X). User 2’s total revenue is 
TY/(1 + Y). User 3’s total revenue is 8Z/(1 + Z). Assume 10 resources are 
available. 

Show how to find the allocations that maximize the total revenue from all 
three users, and the associated shadow price of the resource constraint, using 
Lagrange multipliers. Compare that solution with one obtained from solving 
the model itself, say using Solver in Excel. 

(b) There are two users of resources, A and B, whose income depends on the 
resources they are allocated. Let those allocations be A and B, respectively. 
The income to user A equals 10 A — 0.5 A?. The income to user B is 5B 
— 0.25 B?. You wish to know what allocations result in the maximum total 
income. You only have 14 resources to allocate and are curious what marginal 
increase in total income could result if you had more resources. 
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ABSTRACT 


This chapter introduces how probability and statistical measures can be incorpo- 
rated into models to reflect the uncertainties of model inputs, parameter values 
and output variable values. 


12.1 Introduction 


When the value of a model variable or parameter can vary and is not predictable, 
we often call it random. If we observe many outcomes or values of that variable or 
parameter, we can estimate its probability distribution along with various statistical 
measures, such as its mean, median, and variance, that characterize the probability 
distribution. 

Assume the variable X is a random variable. In this chapter and the following 
chapter, uppercase letters, e.g., X, will denote random variables and lowercase 
letters, e.g., x, will represent the values of that random variable X. 

There are two types of random variables, discrete and continuous. 


12.2 Discrete Random Variables 


A discrete random variable is one that may take on a finite number of discrete 
values such as integers. An example is the outcome of a toss of six-sided dice. 
Possible outcomes are 1, 2, 3, 4, 5, and 6. Examples of other discrete random 
variables include the number of people who visit the public library on Mondays, 
the number of cars parked in a city garage at any given time during the working 
day, the number of rainy days in July, etc. 

The probability distribution of a discrete random variable is a plot of the 
probabilities associated with each of its possible values. This histogram is 
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Fig. 12.1 Probability distribution of discrete possible outcomes of a random variable 
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Fig. 12.2 The cumulative (red) and exceedance (blue) distributions of the discrete random vari- 
able distribution shown in Fig. 12.1 


also sometimes called the probability function or the probability mass function 
(Figs. 12.1 and 12.2). 

Suppose the outcome of a random variable X may be any of n different dis- 
crete values xj, with the probability p; that X = x;. Thus, Pr(X = xi) = pi. These 
probabilities p; must satisfy the following: 


0 < pi < 1 foreach i and pı + p2 +--+ P= 1. 


All random variables (discrete and continuous) have cumulative distribution 
functions. It is a function defining the probability that the value of a random vari- 
able X is less than or equal to a given value x, over the range of possible values 
x. For a discrete random variable, the cumulative distribution function is found by 
summing up the probabilities from the lowest possible value of X to any particular 
value x;. This defines the probability of the random variable value being less than 
or equal to xi, written Pr{X < xi}. Cumulative distribution function values range 
from 0 to 1. 

Subtracting the (red) cumulative distribution from 1 yields the (blue) probability 
of exceedance function, Pr{X > x}. The blue area under this entire probability of 
exceedance function is the mean value of the random variable X. 
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Fig.12.3 A probability 
density function fx(x) for a 
continuous random variable 
X 


12.3 Continuous Random Variables 


A continuous random variable is one having an infinite number of possible values 
between any two limits. Continuous random variables often represent measure- 
ments. Examples include measures of weather like the amount of rain, snow, or 
an average temperature in any given location within a given period of time, the 
maximum daily noise level in a city or at an airport, the time it takes to travel from 
one location to another, the concentration of salt in a river, the height or weight of 
persons in any group of people, etc. 

For a continuous random variable, the probability of an outcome being some 
specific value is 0. For example, the probability of finding someone exactly 6 
feet tall is 0. Hence, continuous probabilities are defined over intervals of values, 
and represent the area under the probability distribution function, called a density 
distribution, within those intervals, such as between a and b in Fig. 12.3. 

A continuous probability function, fx(x), must be non-negative, i.e., fx(x)2> 0 
for all x, and have a total area under the function of 1. 


f fecoas = 


A function fx(x) meeting these requirements is known as a probability density 
function of the random variable X. The subscript denotes the random variable, in 
this case, X, and the argument x represents a particular value of X. Several such 
functions are shown in Fig. 12.4. 

Probability distributions of random variables have a number of statistical 
characteristics. Some common ones are described below. 


12.4 Mean 


The mean of a discrete random variable X is a weighted average of the possible 
values that the random variable can have. If the values xi of each of n observations 
i are equally likely then the probability p; of each value x; is 1/n. This applies to 
the uniform distribution shown in Fig. 12.4. In this case, the mean is the sum of 
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Fig.12.4 Different types of 
continuous probability 
density distributions 


Log Normal Triangular 
Exponential Beta 


all n observations X divided by n. In all cases, the arithmetic mean of a random 
variable is the sum of each possible outcome times its probability. The common 
symbol for the mean (also known as the expected value of X) is wx if based on 
the entire population of random outcomes, otherwise, it is denoted as E(X). 


uxo E(X) = 5 xi pi for a discrete distribution Pr (x). 


L 


uxo E(X) = f xfx(x)dx for a continuous distribution fy (x). 


Note that fx(x)dx is equivalent to p;. It is the area (height times width) under 
the probability distribution function fx(x), the height, times the infinitely small 
width interval dx. 

The mean of a random variable X is the expected average outcome over many 
observations. The mean is not necessarily the most likely outcome, however. Con- 
sider a game in which you have a 90% chance of doubling your money every 
time you play it. Otherwise, you lose all your past winnings. The more times it’s 
played, the higher the expected winnings and the higher the probability that you 
will have nothing. 


12.6 Normal Distribution 147 
12.5 Variance 


The variance of a probability distribution is a measure of its spread, or variability. 
It is defined by the sum of each of the squared differences between the mean and 
possible x; or x values, times their associated probabilities, p; or fy(x)dx. Again, 
if it is based on the entire population of data, it is commonly denoted by ox? or 
var(X), otherwise by S x7. 


oe or var( X) or Se = X Gi = wW? pi 


ox orvar(X) or Sy = f (x — w? fx(x)dx. 


The standard deviation, ox or std(X) or Sx, is the square root of the variance. 


12.6 Normal Distribution 


A normal distribution has a symmetrical bell-shaped density function centered 
about its mean, with its spread determined by its variance or standard deviation. 
The height (value) of a normal density distribution of the random variable X at a 
point x is given by the equation in Fig. 12.5. 

If a dataset is normally distributed, then about 68% of the observations will 
fall within plus and minus one standard deviation, o, of the mean, which, in this 
standard case shown in Fig. 12.5, is within the interval (—o, o). About 95% of 
the observations will fall within plus and minus 2 standard deviations of the mean, 
which is the interval (—20, 20) for the standard normal. About 99.7% of the obser- 
vations will fall within 3 standard deviations (—30, 30) of the mean. Although it 
may appear as if a normal distribution does not include any values beyond a cer- 
tain interval, the density function is actually positive for all values of x from — to 
+ oo. Data from any normal distribution may be transformed into data following 
the standard normal distribution by subtracting the mean from the observation and 
dividing by the standard deviation, i.e., (x — myo. 


Fig.12.5 Standard normal 
probability distribution with 


mean p = 0 and standard 

«ati l -LEY 
deviation of o = 1. The ovr’ Ive 
percentages indicate the mT 


approximate percentage of 
the total area, 1, within each 
segment of the distribution 


-20 -0 (0) oO 20 
3% 13% 34% 34% 13% 3% 
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Fig.12.6 Distinguishing 
among the mode (most 
likely), median, and mean 
values of a probability 
distribution. For a normal or 
other symmetric distribution, 
their values are all the same Mode: 


Median’ 


12.7 Median 


The median of a probability distribution is the value of the random variable that 
has a 50% chance of being exceeded. Half of the area of the distribution is to the 
left of the median and half is to the right. It is the value of the random variable 
whose cumulative probability is 0.5 (Fig. 12.6). 

For example, consider a continuous random variable X that ranges from 0 to 
10 and whose triangular density function is 0.02x. Its cumulative distribution is 
the integral of 0.02 x or 0.01x? from 0 up to x = 10 and 1 for all values>10. The 
median is when this function equals 0.5. Hence, the median is x = ./50. 


12.8 Mode 


The most likely value, the mode, of a continuous or discrete probability distribution 
is that which has the highest probability (Fig. 12.6). 


12.9 Conditional and Joint Probabilities 


Consider two random events, such as the outside temperature in two successive 
days. Let them be denoted by A for the first day and B for the following day. 
Each has various intervals of outcomes and associated unconditional probabilities. 
However, the probability of a particular outcome of B on the second day may be 
dependent on the actual outcome of A on the first day. This conditional probability 
can be denoted as Pr(BIA), the probability of an outcome of B given an A outcome. 
In the case where events A and B are independent (where event A has no effect 
on the probability of event B), the conditional probability of event B given event 
A is simply the probability of event B, Pr(B). Two successive coin tosses would 
be an example of this. The first toss is A, and the second, B. The outcome of toss 
A does not influence the outcome of toss B. Each possible outcome of each toss 
has a probability of 0.5. The joint probability (the probability of both outcomes) 
of two coin tosses, or any other independent events A and B, would be Pr(A, B) = 
Pr(A)Pr(B). 
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If events A and B are not independent and the outcome of A influences that of 
B, then the joint probability of two particular outcomes of A and B is defined by 


Pr(A, B) = Pr(B|A) Pr(A). 


From this definition, the conditional probabilities Pr(BIA) of each possible 
outcome are easily obtained by dividing the joint probability Pr(A, B) with Pr(A): 

For example, assume both A and B are states of the temperature in two succes- 
sive days. Consider only two possible states: ‘hot’ and ‘cold’. Assume data show 
that at any time the probability A is cold = 0.6, and that B is cold = 0.7. Data also 
show that if A is cold, the probability that B is also cold = 0.9. If A is hot, then 
the probability B is cold = 0.4. Clearly, the probability of the state of B depends 
on the state of A. 

Summarizing: 

The unconditional probabilities: 


Pr(A is cold) = 0.6, thus Pr(A is hot) = 0.4, 
Pr(B is cold) = 0.7, thus Pr(B is hot) = 0.3, 


since if cold and hot are the only possible outcomes, these discrete probabilities 
must add to 1. 
The conditional probabilities: 


Pr(B is cold given A is cold) = 0.9, thus Pr(B is hot given A is cold) = 0.1. 
Pr(B is cold given A is hot) = 0.4, thus Pr(B is hot given A is hot) = 0.6. 


Using the fact that the joint probability of A and B, Pr(A, B) = Pr(BIA) Pr(A), 
the joint probability of both A and B being cold = 0.9 (0.6) = 0.54. The joint 
probability of both being hot, using the same equation, is 0.6 (0.4) = 0.24. The 
joint probability of only A being hot is 0.4 (0.4) = 0.16. The joint probability of 
B being hot and A being cold is 0.1 (0.6) = 0.06. The joint probabilities of these 
four possible outcomes sum to 1.00. 


Pr(A is cold, B is cold) = 0.9 (0.6) = 0.54, 
Pr(A is hot, B is hot) = 0.6 (0.4) = 0.24, 
Pr(A is hot, B is cold) = 0.4 (0.4) = 0.16, 
Pr(A is cold, B is hot) = 0.1 (0.6) = 0.06. 


Of interest may be the conditional probabilities Pr(AIB). 

A method for calculating the conditional probabilities Pr(AlB) is by using 
Bayes’ formula. The formula is based on the expression Pr(B) = [Pr(BIA is 
cold)Pr(A is cold)] + [Pr(BlA is hot)Pr(A is hot)], which simply states that the 
probability of a state of B, Pr(B), is the sum of the conditional probabilities of that 
state of B given that A is cold or is not cold. For independent events A and B, this 
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is equal to Pr(B)Pr(A is cold) + Pr(B)Pr(A is hot) = Pr(B)(P(A is cold) + Pr(A 
is hot)) = Pr(B)(1) = Pr(B), since the probability of an event and its complement 
must always sum to 1. Bayes’ formula is defined as follows: 


[Pr(B]A) Pr(A)] 
[Pr(B|A is cold) Pr(A is cold) + Pr(B|A is hot) Pr(A is hot)] 


Pr(A|B) = 


Thus using the numerical example 


Pr(A is cold |B is cold) = 0.9(0.6)/[0.9(0.6) + 0.4(0.4)] = 54/70, 

Pr(A is cold |B is hot) = 0.1(0.6)/[0.1(0.6) + 0.6(0.4)] = 6/30, 

Pr(A is hot |B is cold) = 0.4(0.4)/[0.9(0.6) + 0.4(0.4)] = 16/70, 

Pr(A is hot |B is hot) = 0.6(0.4)/[.0.1(0.6) + 0.6(0.4)] = 24/30, 
Sums : 1.0. 


12.10 Marginal Distributions 


Summing the joint probabilities Pr(A, B) over all the possible B outcomes yields 
the marginal probability distribution of A. Thus, the probability of A being cold is 
the joint probability of both A and B being cold, 0.54, plus the joint probability of 
only B being hot, 0.06, which sums to 0.60. The probability of A being hot is the 
joint probability both A and B being hot, 0.24, plus the joint probability of A being 
hot and B being cold, 0.16, which sums to 0.40. Both sum to 1, as they should 
since A can only be cold or hot. 
Similarly, for finding the probability of the different states of B. 


Pr(B is cold) = Pr(Bis cold and A is cold) + Pr(B is cold and A is hot) 
= 0.56 + 0.16 = 0.7 


Pr(B is hot) = Pr(B is hot and A is cold) + Pr(B is hot and A is hot) 
= 0.06 + 0.24 = 0.3 


The general equation for finding single or multiple variable marginal distribu- 
tions from joint probability distributions is by summing joint probabilities over all 
the values of the other variables. 


Pr(Y) = ) Pr(X, Y) 


For example, let X and Y be two random variables denoting the outcome of 
two coin tosses. Their joint probability is Pr(X, Y). Since they are independent, 
Pr(X, Y) = Pr(X)P(Y) = (0.5)(0.5) = 0.25 for each combination of X and Y. Using 
these joint probabilities, the probability of X being Heads or Tails, Pr(X), is Pr(X, 
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Y = Heads) + Pr(X, Y = Tails) = 0.25 + 0.25 = 0.5. Similarly, for finding the 
probability of any Y outcome, Pr(Y). One would sum the joint probabilities overall 
outcomes of X. 

The same procedure applies to continuous random variables. For two continu- 
ous random variables X and Y, the probability of the outcome of X being within 
a specified range of x values is fx(x) = Sy fxy(x, y)dy. 


12.11 Pedestrian Safety 


Suppose that the probability of a person (or duck as in Fig. 12.7) being hit by a 
vehicle while crossing the road at a pedestrian crossing is to be computed. Let 
H be the discrete random variable that has two possible outcomes, ‘hit’ and ‘not 
hit.’ Let L be a discrete random variable taking on three possible crosswalk light 
values: red, yellow, and green. 

Realistically, the probability of being hit when on the crosswalk will be depen- 
dent on the value of L. That is, Pr(H = Hit) and Pr(H = Not Hit) will take different 
values depending on whether L is red, yellow, or green. A person is, for example, 
far more likely to be hit by a vehicle when trying to cross when the crosswalk light 
is red instead of green. For any given possible pair of values for H and L, one must 
consider the joint probability distribution of H and L to find the probability of any 
pair of events H and L occurring together. 

Of interest is the probability Pr(H = hit) when we do not know the value of 
L. In general, a pedestrian can be hit if the light is red or yellow or green but the 
probabilities of being hit will differ. In this case, the answer for the probability of 
H can be found by summing Pr(HIL) over all possible values of L, with each value 
of Pr(HIL) weighted by the probability of that value of L occurring. 


Fig.12.7 Is it safe to cross 
the road? Robert McCloskey, A EEA A E e arcs 
Make Way for Ducklings, The Po) Tae 
Viking Press (1941). https:// 
en.wikipedia.org/wiki/ 
Make_Way_for_Ducklings 
Creative Commons CCO 
License 
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Table 12.1 Conditional probability distributions, Pr(HIL), determined from data 


Light L Red Yellow Green 
H = Hit 0.99 0.90 0.02 
H = Not Hit 0.01 0.10 0.98 


Table 12.2 Joint probabilities of H and L 


Joint distribution P(HL) 
Light L Red Yellow Green Marginal distribution P(H) 
H = Hit 0.198 0.09 0.14 0.428 
H = Not Hit 0.002 0.01 0.56 0.572 
Total 0.2 0.1 0.7 1 


Table 12.1 shows the conditional probabilities of being hit, depending on the 
state of the lights. (Note that the columns in this table must add up to 1 because 
the probability of being hit or not hit is 1 regardless of the state of the light.) 

To find the joint probability distribution, we need to know what fraction of the 
times the light shows each color. These fractions can be considered to be their 
probabilities, P(L). 

Assume that Pr(L = red) = 0.2, Pr(L = yellow) = 0.1, and P(L = green) = 
0.7. Multiplying each conditional probability in each column by the probability of 
that light occurring, defines the joint probability distribution of H and L. These 
are given in the central 2 x 3 block of entries in Table 12.2 (Note that the cells in 
this 2 x 3 block add up to 1. The totals of the columns are the probabilities of the 
different values of L). 

This analysis shows that if a pedestrian (or duck) pays no attention to the 
crosswalk light, the 


Pr{H = Hit} = 0.428 


and the probably of making it across the road safely is 
Pr(H = Not Hit} = 0.572. 
If the probability of being hit seems high, then perhaps one should pay attention 


to the crosswalk light before crossing the street. If crossing occurs only on the 
green, the probability of being hit is 0.02, as shown in Table 12.1. 
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12.12 Sources of Uncertainty 


What makes a variable random and unpredictable? Three major causes are defined 
in Fig. 12.8. 

Often there is little one can do to reduce all the uncertainty in the data available 
to analysts, and hence why so many policy models must explicitly deal with the 
existing uncertainties. The ways in which this can be done is what this chapter has 
attempted to introduce. 


Exercises 

1. Security 

You have a job that requires you to be protected some of the time. The probability 
that the needed hours of protection, P, will be less than p is 0.2p-0.01p~. The cost of 
protection is $50 each hour. What is the expected daily cost for your protection? 

2. Probability of being flooded 

The probability of a flood expected to be exceeded once in n years on average is 
called the n-year flood. What is the probability of observing at least one 100-year 
flood or greater over a 30-year period, assuming annual floods (maximum flow that 
occurs in a year) are independent events? 

3. State Lottery 

You can buy lottery tickets from the State for $1 each. Each ticket has a 3-digit 


number; each number is equally likely. Owners of winning tickets receive $500 for 
each winning ticket. 


Fig.12.8 Various causes of uncertainty 
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Suppose you buy | ticket each week of an entire year, i.e., 52 tickets. 


(a) Show how to calculate the probability that you will win at least one lottery in 
the year (The answer is 0.0507.) 

(b) If the lottery sells 1,000,000 tickets this week, what is the expected income to 
the State? Note: The expected income of | million tickets is the expected income 
from one ticket times | million. 

(c) Show how to calculate the variance of this income. 


4. Book sale 


Twice a year a town has a used book sale, and at the end of the sale, they offer any 
book they have for $1. The cost of handling books is estimated to be about $0.65 per 
book. 

Past sales indicate that the probabilities of various ranges of books being 
demanded is as follows: 


Hundreds of books Probability of demand Probability of Average Pr 
exceedance (exceedance) 

0-2 0 1 1 

2-4 0.1 1-0.9 0.95 

4-6 0.4 9-0.5 0.7 

6-8 0.4 5-0.1 0.3 

8-10 0.1 0.1-0 0.05 

10-12 0 0 0 


How many books should they have available to maximize their expected net 
revenue from the sale? 


5. Bake sale 


The mayor of a town is considering having a $100-dollar-a-plate dinner to increase 
the funds available for the homeless. Her problem is that she doesn’t know how many 
people might come. Experience suggests that it largely depends on whether it rains 
or not. The local weather service has indicated that the probability of a dry day is 
0.70. 

Invitations must be sent out two weeks in advance of the dinner. 

If it doesn’t rain, there is an 80% chance that 500 people will attend and a 20% 
chance that only 300 will attend Gust to make it simple). If it rains, there is a 60% 
chance that 350 will attend and a 40% chance that only 200 will attend. Each dinner 
ordered in advance costs $20. Everyone that comes must be served dinner. Each 
additional dinner ordered because of a shortage cost $30. 


(a) How many dinners should the mayor order in advance of knowing how many 
will attend the dinner? 
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(b) What is the maximum amount the mayor would be willing to pay for a weather 
forecaster that could predict for certain whether or not rain would occur on a 
particular day? The date of the dinner could then be set after such a forecast is 
made. 


6. Finding means, variances, medians 


For the following probability density functions, fx(x), of a random variable X, inte- 
grate them to find the equations for the cumulative distribution functions, F(x), 
(ranging from 0 to 1), and the median, mean, and variance of each of the distri- 
butions. Finally, compute the area under the probability of exceedance function, 1 — 
F(x). It should equal the mean. 


fx(x) falx) f(x) f(x) 


7. Swimming 


Assume that admission to a public outdoor swimming pool in an urban area costs 
$5 per person. Also, assume the probability distribution of tickets sold per hour is 
uniform from 5 to 15, (as shown above in Exercise 6). Find the expected revenue per 
hour (You should be able to guess at the expected number of people buying tickets 
and that times $5 will be the expected revenue). 


8. Planning a Park 


A recreational park is being planned. It borders a lake. Planners need to decide 
at what lake level to build the recreational facilities such as docks, boat landings, 
picnic facilities, restrooms, etc. The potential benefits derived from locating all these 
facilities at higher lake level elevations increase due to the increasing shore-line 
perimeter (length) and flatter areas to develop. 


The developers assume the marginal benefits obtained will equal $5 per unit target 
elevation level if the actual lake is at that target level. But the lake level varies over 
the recreational season. No matter what target level is chosen for development, the 
actual lake level will likely differ. The developers estimate there will be a loss of 
$7.5 per unit deficit (difference between target level and lower actual lake level) or 
a loss of $1 per unit excess if the lake level is above the target level. 

For example, if the target level is 5, but the actual level is 4, the net income will 
be $5(5)-(5-4)7.5 = 17.5. If the actual level is 6, the net income will be $5(5)-1(6-5) 
= 24, 
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Assume for simplicity the probability distribution of lake levels during the recre- 
ational season varies over a range of 0 to 10 units uniformly. What target level within 
that range from 0 to 10 will maximize expected net income? 

Discuss a modeling approach you would use to find the best value of the target 
level, and demonstrate its use. 


9. Birthday problem 


What is the probability P of at least two in a group of n people having the same 
birthday (month and day)? Write the expression for P. 


10. Heart Attacks 


Serious heart attacks occur in a county on an average of once every two weeks but 
are random. 


(a) How many heart attacks should the physicians expect to respond to in a single 
year, on average? 
(b) What is the probability that at least two heart attacks will occur on the same day? 


11. Taxicab problem 


Three taxi stands that are serviced by taxi companies: Sites A, B, and C. 
Three policies have been tested but not analyzed: 
Policy 1: cruise around the site and pick up first person wanting a ride. 
Policy 2: return to the nearest taxi stand and wait for the rider. 
Policy 3: wait at the nearest site for a radio call (not available at B). 


Questions: 


What is the best policy at each site? 
Given the best policy, what is the probability of being at each site? 
Given best policy, what is the expected net income from each rider picked up 
at each site? 
e What is the overall expected net income per rider? 


To answer the questions, you will need data. 


Data 
Average costs, C(ik), of policy k at site i and resulting trip count: 


Site i Policy k C(ik) No. of trips to site j: Probabilities P(ijk) = Plik) 


A B iC x A B Ç 


2 5 4 48 12 64 1/16 0.75 3/16 
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Site i Policy k C(ik) No. of trips to site j: Probabilities P(ijk) = Plik) 
A B Cc x A B G 
B 1 1 45 0 45 90 0.5 0 05 
2 6 5 70 5 80 1/16 7/8 1/16 
C 1 2 15 15 30 60 0.25 0.25 0.5 
2 4 8 48 8 64 1/8 0.75 1/8 
3 5 36 3 9 48 0.75 1/16 3/16 


Sitei Sitej TC(ij) 


Owwpypp 
ANWNDWSY 


Average income Y(ijk), costs C(ik), and net income R(ijk), from site i, policy 
k, and destination j. 
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Sitei Policy k Site} Yiik Tcii Cik Rijk 
A 1 > A 14 1 3 10 
2 > A 14 1 5 8 
3 > A 14 1 9 4 
A 1 > B 11 4 3 4 
2 > B 11 4 5 2 
3 > B 19 4 9 6 
A 1 > C 18 7 3 8 
2 > C 16 7 5 4 
3 > C 20 7 9 4 
B 1 > A 19 4 1 14 
2 > A 18 4 6 8 
B 1 > B 3 2 1 (0) 
2 > B 24 2 6 16 
B 1 > c 24 5 1 18 
2 > C 19 5 6 8 
C 1 > A 19 7 2 10 
2 > A 17 7 4 6 
3 > A 16 7 5 4 
C 1 > B 9 5 2 2 
2 > B 13 5 4 4 
3 > B 10 5 5 0 
C 1 > Cc 12 2 2 8 
2 > Cc 8 2 4 2 
3 > C 15 2 5 8 


12. Public Library 


A town’s public library needs more space. Recently, the town had to decide 
whether to relocate or renovate its public library. The old, and now empty, Wool- 
worth Store was a potential new location. A Foundation indicated they would give 
the town $2.5 million if they immediately chose the Woolworth Store. This gift 
would help pay the estimated relocation cost of $9.5 million. It was not clear that 
the Foundation would give the $2.5 million to the town if the town chose to reno- 
vate the existing library or to delay the relocation decision to first determine if the 
Woolworth Store could be rented. 

The debate over what to do centered on the question of whether the Woolworth 
Store could be rented, and hence generate tax revenue for the town. If the library 
were moved to the old store, there would be no tax revenue derived from that 
store but there would be some income derived from the sale of the existing library 
building—if they could sell it. 

Assume that when the Foundation made the offer, you were asked to help the 
town decide what to do. 

You reason the town has some choices: It could decide to move its public 
library to the old Woolworth Store, or it could hire a consultant to evaluate the 
suitability of that store for another business and to obtain a better estimate of the 
likely income from the sale of the existing library building. If the town decides to 
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move the library, the Woolworth relocation cost would be $7 million ($9.5 million 
less the Foundation gift of $2.5 million) and take two years. If the town hires a 
consultant, the consultant will charge the town $100,000 and require 6 months to 
make a recommendation. The benefits of a relocated or renovated library would 
be delayed by the additional 6 months required by the consultant. 

If the consultant is hired and indicates the old Woolworth Store has no commer- 
cial value, then the relocation process could take place immediately, at a cost of $7 
million or $9.5 million, depending on whether the Foundation gives the town $2.5 
million, less the expected income from the sale of the existing library building. On 
the other hand, if the consultant indicates the old store has commercial value, the 
town could act immediately to renovate the existing library, or it could wait and 
try to rent the store over the coming year. If, after a year, the store is not rented, 
the town would relocate the library. The relocation costs and time remain the same 
as before: $7 million or $9.5 million over two years, depending on whether the 
Foundation gives the town $2.5 million, less the expected income from the sale 
of the existing library. In addition, the benefits of not having a new facility are 
further delayed by the waiting period, say a year. 

Renovation of the existing library will take 2 years and cost $13.5 million 
or $11 million, again depending on the Foundation’s $2.5 million gift decision, 
less the expected capitalized tax revenues from the rental of the Woolworth Store 
(considering the possibility that it might not be rented). 

If the town waits to see if it can rent the store, and succeeds in renting the store, 
say in a year, then it can begin the renovation of the existing library, again at a 
cost of $13.5 million or $11 million, depending on the Foundation’s $2.5 million 
gift decision, plus the lost benefits to the library users of delaying another year, 
less the capitalized (present value of the) tax revenues from renting the Woolworth 
Store. 

Show how you would determine how to advise the town. Should the town 
relocate its library now or hire a consultant? What are your decision criteria? 
What probabilities do you need to estimate to answer this question? What other 
assumptions do you have to make? How would you determine how sensitive your 
recommendation is to all those assumptions? 


13. Immigrants 


Suppose you are a designer of a facility to temporarily house immigrants entering 
the country. The number of immigrants needing housing in the facility each week 
varies. Data exist that allow you to calculate the probability distribution of the 
number of people needing housing each week. Let P represent the discrete random 
variable for the number of people needing housing, and Pr(p) be the probability 
that P = p. The sum over all p of Pr(p) equals 1. 

Your job is to determine the target population level of your new facility, realiz- 
ing that you may have more or less than that target level each week. Those running 
the facility will get paid a certain amount based on both the target capacity of the 
facility and the actual average number in the facility each week. 
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The revenue obtained from having an amount equal to the target population, 
T, are defined by the concave function R(T) as shown below. Note, if T were 20 
and 20 people were housed, the benefits would equal -5 + 16(13) + 8(7). The -5 
reflects fixed costs if the facility is built. If it is not built, T = 0 and R(T) = 0. 


R(T) 


Target population T 


If the number of people in the facility is not equal to the target value T, there 
is a reduction in total net revenue. For each person less than the target, there is a 
loss of $21. For each person in excess of the target, there is a loss of $3. 

The loss function is shown below. Note: Losses are a function of the deviations 
from the target population T and are assumed independent of the value of the 
target number, T. 


Loss 


$3 


T Population housed 


For example, suppose the T is 20 and the actual number housed is 15. The total 
net benefits would equal R(T) — 21(20-15) = —5 + 16(13) + 8(7) — 21(5). 

Develop a linear model that will find the value of the target number T that 
maximizes the expected total net revenue. Note: Total expected net revenue = 
revenue obtained from the target T less expected losses from deviations from target 
associated with each value p of P and its probability Pr(p). Show the model needed 
to determine the target T that maximizes total expected revenue. 


14. Licenses 


The State allocates hunting licenses to a store that sells them for $100 each. The 
demand for licenses is uniformly distributed between 10 and 30. At least 10 will 
be demanded and at most 30 will be demanded at that store. 


(a) Define the expected income function associated with any allocation ‘x’ of 
hunting licenses. Sketch the function. 
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(b) Assume there are two stores, but the demand distribution at the other store is 
uniform between 5 and 15. If only 25 licenses are to be allocated, how many 
licenses should be allocated to each store that will maximize the total expected 
income. 
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ABSTRACT 


Many public systems must deal with uncertain inputs over time. This chapter 
illustrates how models incorporating uncertain inputs over time can be devel- 
oped and solved. Stochastic linear and dynamic programming models are 
developed to show the difference in output that define optimal sequential 
conditional decision making strategies. 


13.1 Introduction 


A stochastic process refers to a system whose outputs are random over time. The 
sequence of newly infected people with a particular disease in a city, the sequences 
of coin tosses, the daily flows in the Danube River at Vienna, or the number of cus- 
tomers seeking driver license renewals at a local motor vehicle office each weekday 
are all examples of stochastic processes. While we cannot predict the outcome of 
any stochastic process precisely, we may be able to predict the probabilities of 
various outcomes of systems as influenced by any decisions made affecting their 
operation. 

The examples presented in this chapter will be limited to simple first-order dis- 
crete stochastic processes. These are defined by conditional probabilities of being 
in some state Stı in period t+1 given the state S; in period t. We cannot predict 
what future states may be, but we assume we can predict the probabilities of being 
in various future states based on the current state. These predictions, expressed 
as conditional probabilities, Pr(S;+; | St), may be based on historical time series 
data whose statistical characteristics may apply in the future as well. What is also 
implied by using conditional probabilities is that the probability of some state 
value Są in period t+1 is dependent only on the actual value of the state S, in the 
previous period t and not on previous state values. Hence, the use of the term ‘first- 
order’. The validity of such an assumption may largely depend on the duration of 
the time periods being modeled. 
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Fig. 13.1 An example of a stochastic process involving uncertain outcomes over time. Public 
Domain. File:DJIA 2000s graph (log).svg, https://en.wikipedia.org/wiki/Dow_Jones_Industrial_A 
verage#/media/File:DJIA_2000s_graph_(log).svg 


13.2 Changing Weather 


For example, consider two types of weather, good, G, and bad, B. Based on the fol- 
lowing sequence of 20 days of observations, GGGBBBBGGGBBBGGGBBBB, a 
matrix of conditional probabilities can be created. The rows of this matrix rep- 
resent the possible values of the weather in day t, S;, and the columns represent 
the possible values of the weather in the next day t + 1, Sı (Fig. 13.2). Out 
of 19 transitions from one state to another in this time series, 6 were from Good 
to Good and 3 were from Good to Bad, for a total of 9 transitions from Good. 
From the state of Bad, 2 became Good the next day, and 8 remained in a Bad 


Fig. 13.2 Good weather 
days and bad weather days. 
They happen and are only 
temporary. Public domain. 
https://i.pinimg.com/origin 
als/e3/0c/21/e30c2162f96b 
£54a059876d092906358.jpg 
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St: G B Sum 
S G 0.667 0.333 1 
B 0.2 0.8 1 


Fig. 13.3 The matrix of conditional or transition probabilities above resulting from the recorded 
time series of good and bad days. It is called a first-order Markov chain whose rows sum to 1 


state. Dividing each number of transitions from a Good state by the total number 
of transitions from Good, and the same for transitions from a Bad state defines 
the conditional probabilities that must sum to | on each row of the matrix. These 
conditional probabilities are also called transition probabilities—the probability of 
making a transition from one state in period t to another state in the next period, t 
+ 1. 

Using these conditional probabilities, shown in Fig. 13.3, one can compute the 
probabilities of having a good or bad day in successive days t+1, t+2, t+3... given 
the current state of the weather in day t. 


Pr(Gint+ 1) = Pr(Gint)Pr(G int + 1|G in t) + Pr(B int)Pr(G int + 1|B int), 


t = 1,2; 3,4, 2.5 


Pr(B int+ 1) = Pr(Gint)Pr(B in t+ 1|G in t) + Pr(B in t)Pr(B int + 1|B int), 
(21525354, tox 


Eventually, the predicted probabilities will not change significantly from one 
day to the next, as one would expect. The probability of the state of weather a 
month from now is not likely to be influenced by the weather today. 


13.3 The Stock Market 


For another example, consider successive states of the stock market. Assume the 
stock market can be in one of three states: 1 = bear market. 2 = strong bull market. 
3 = weak bull market. Historically, a certain mutual fund gained —3%, 28%, and 
10% annually when the market was in states 1, 2, and 3, respectively. The state 
transition matrix defining each P(Sy+1ISy) is shown in Fig. 13.4. 

Referring to these conditional or transition probabilities, we can determine 
what the probabilities of future states may be given the present state, as shown 
in Fig. 13.5. Assume the present state, S4, is 1. 

The process shown in Fig. 13.5 continues until it converges to 0.333, 0.200, and 
0.467 for states 1, 2, and 3, respectively. These are termed steady-state values that 
do not change in subsequent periods. They are the unconditional probabilities of 
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Fig.13.4 Markov chain Year y+1 

showing transition State: 1 2 3 
probabilities for three states 
of the stock market 10.90 0.02 0.08 


Yeary 2/|0.05 0.85 0.10 


3 |/0.05 0.05 0.90 


Fig.13.5 Probabilities of Year y+1 
the state of the stock market Yeary State: 1 2 3 
for three successive years 1 lf o4.0.0~- 
2 0.90 0.02 0.08 
3 0.815 = 0.9(.9) +0.02(0.05) +0.08(0.05) 
0.039 = 0.9(0.02) +0.02(0.85) +0.08(0.05) 


0.146 =0.9(0.08) +0.02(0.1) +0.08(0.9) 


each state, and as one might guess, they are not influenced by the starting state in 
period 1. The state of this mutual fund 10 years from now will not likely depend 
on what it is now. These same steady-state values will result from any assumed 
state in year 1. 

These steady-state values can be computed directly using the same equa- 
tions used to compute successive probabilities as shown above but with unknown 
probabilities of each given state. 

Thus, for this example, solving at least two of following three equations: 


Pr(S = 1) = Pr(S = 1)(0.90) + Pr(S = 2)(0.05) + Pr(S = 3)(0.05), 
Pr(S = 2) = Pr(S = 1)(0.02) + Pr(S = 2)(0.85) + Pr(S = 3)(0.05), 
Pr(S = 3) = Pr(S = 1)(0.08) + Pr(S = 2)(0.10) + Pr(S = 3)(0.90), 


together with the equation expressing the fact that Pr(S=1) + Pr(S=2) + Pr(S=3) 
= | will determine the steady-state values of each Pr(S), namely 0.333, 0.200, and 
0.467 for S = 1, 2, and 3, respectively. 

In general, for any Markov chain having rows i and columns j with transition 
probabilities TP(SjIS;), 


Pr(Sj) = $ Pr(Si) TP(S)ISi) Vi 


> Pr(Si) = 1. 
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Using the unconditional steady-state probabilities, Pr(Sj), (such as found by 
solving the above equations) the expected annual yield is 


—3(0.333) + 28(0.2) + 10(0.467) = 9.3%/year. 


The expected yield, i10, over 10 years = (1.093)!° — 1 = 2.4333 — 1 or 143% 
Hence, investing $1 in this mutual fund, one can expect to have $2.43 in 10 years. 


13.4 Human Health 


The state of one’s health is also a stochastic process. Consider for this example 
four discrete states of health. Using data from the public health department, the 
following Markov chain shows the conditional probabilities of an average person 
being in any state of health given a previous state (Fig. 13.6). 

We can use Excel, for example, to find the progression of state probabilities 
from some assumed initial state, solving successive equations: 


Pr(Sj),41 = >, Pr(Si): TP(S)ISi) Yj t = 2,3, 4, 
i 
Alternatively, we can find the steady-state probabilities of being in any state of 
health by solving 


Pr(Sj) = $ Pr(Si) TP(Sj|Si) Yj 
SPS) =1 


directly for the steady-state probabilities Pr(Sj) for each Sj. 

These steady-state probabilities are shown in Table 13.1. 

Next consider another state of health: death. Assume the Markov chain defining 
the transition probabilities for states of health is as shown in Fig. 13.7. 

Solving the same set of equations as shown above defines the steady-state prob- 
abilities for these five states of health. They are as expected. They all are 0, except 
death. Its steady-state probability is 1. Such is life (or rather death). In the long 
run, we all are certain to die. Once dead we cannot transition to another state of 
health (as far as we know). Mathematicians call this a trapping state. Once in it, 
you cannot get out. 


Fig.13.6 Transition Period t+1 
probabilities for states of States: : 
: è | 
health from one period to the weli ico. iiu Seron 


next well 
Periodt cold 
flu 


serious 
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Table 13.1 Steady-state Variable Value 
probabilities of various states 
of health Pr( well) 0.5671927 
Pr( cold) 0.2368245 
Pr( flu) 0.1217599 
Pr(serious) 0.0742229 
Fig.13.7 Transition Period t+1 
probabilities for successive g 
states of health State: well cold flu serious death 


well} 0.70 0.25 0.03 0.01 0.01 
Periodt cold| 0.60 0.20 0.13 0.05 0.02 
flu | 0.20 0.30 0.40 0.06 0.04 


serious | 0.05 0.15 0.20 0.50 0.10 


death) 0 0 0 0 1 


13.5 Reducing Crime 


This is an example of building stochastic linear and dynamic programming 
optimization models incorporating transition probabilities. 

A community center provides recreation facilities for people. The impact on 
the community is lower crime rates. Assume, again for simplicity, there are two 
states of crime rates—low (L) and high (H). Observed crime rates over time show 
that if the crime rate is low in any month, the probability of having a low rate 
the following month is 0.7. The probability of having a high crime rate month 
following a low crime rate month is 0.3. If the crime rate is high in a month, 
the probability of a high crime rate the following month is 0.6, and thus, the 
probability of a low crime rate is 0.4. These probabilities apply if the community 
center does not advertise its services and facilities. This is the do-nothing policy. 
(Policy n). These conditional probabilities are shown on the left of Fig. 13.8. 

However, if the center advertises its recreation programs (Policy a), the 
conditional probabilities change to those shown on the right of Fig. 13.8. 


Fig. 13.8 Transition Month t+1 Month t+1 
probabilities associated low 
and high crime rates 
associated with two policies Montht L| 0.7 0.3 Montht L | 0.8 0.2 
‘n’ (do-nothing) and ‘a’ 


(advertise) H04 06 H |06 04 


Policy n: L H Policy a: L H 
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There are costs involved in advertising as well as additional costs associated 
with high crime rates. These costs, denoted as C(j,k) associated with crime rate j 
and policy k, are listed in Table 13.2. 

The objective is to find the policy associated with each state that minimizes the 
expected value of the monthly total cost. Letting the unknown joint probability of 
any combination of crime rates i followed by j, and policy k, be Pr(i,j,k), then the 
objective can be written as the sum over all values of i, j, and k, of the associated 
costs, CG.k), times their joint probabilities, Pr(i,j,k): 


Minimize > > > C(j, K) Pr(i, j, K). 
i j k 


To determine the steady-state values of each joint probability Pr(i,j,k), we can 
first define the marginal probabilities Pr(j,k) by summing the joint probabilities 
Pr(i,j,k) over all initial crime rates i. 


Prj, k) = È` Pr(i, j, K) Vj, k. 
i 


Each joint probability Pr(i,j,k) equals Pr(i,k) at time t times the known transition 
probability, TP(i,j,k), of state j at time t+1 given state i in period t and policy k. 


Prdi, j,k) = Pr(i, k) TPG, j, K) Vi, j, k. 
Combining these two equations 


Prj, k) = X Prá, j, k) TPG, j, k) Vj, k 
i 


and together with 


Pr(i) = YG, k) Vi 
k 


defining the steady-state probabilities of each crime state and }_; Pr(@i) = 1. 


Table 13.2 Costs associated 
with the crime rate and policy 


Cost C(j,k) 
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This defines a linear optimization model that when solved will give us the 
optimal policy k depending on the state of crime as well as the minimum monthly 
expected total cost. 

For each state i, the policy k whose joint probability Pr(i,k) (either Pr(i,n) or 
Pr(i,a)) is non-zero will be the best policy. Its conditional probability, Pr(kli), will 
equal 1. Otherwise, it will equal O unless it doesn’t matter what policy is chosen. 


Pr(kli) = Pr(i, k)/Pr(i). 


The solution of this model is 
Objective value: Minimum monthly expected cost = 8.33. 


Pr(L) = 0.667 = steady-state probability of low crime rate if optimal policy 
followed. 


Pr(H) = 0.333 = steady-state probability of high crime rate if optimal policy 
followed. 


Pr(L, n) = 0.667 implies that if in state L, do not advertise. 
Pr(L, a) = 0.0 implies that if in state L, do not advertise. 
Pr(H, n) = 0.0 implies that if in state H, advertise. 


Pr(H, a) = 0.333 implies if in state H, advertise. 

These values are derived from the values of the joint probabilities Pr(i,j,k) listed 
in Table 13.3. 

An alternative linear programming model based on Fig. 13.8 is perhaps more 
straightforward. Let the probability Pr(State, policy), denoted here as PLn and PLa, 
be the indicator of the best policy given the state. Again, the one that is non-zero 
indicates the best policy. The probabilities of the states, Pr(L) and Pr(H), denoted 
as PL and PH in the model below, result if the optimal policy is followed. 


Minimize PLn*(TPLLn*CL + TPLHn*CH) + PLa*(A + TPLLa*CL + TPLHa*CH) + 
PHn*(TPHLn*CL + TPHHn*CH) + PHa*(A + TPHLa*CL + TPHHa*CH). 


of joint probabltes Pri CL) ia 
Pr(L,L,a) 0.000 
Pr(L,H,n) 0.200 
Pr(L,H,a) 0.000 
Pr(H,L,n) 0.000 
Pr(H,L,a) 0.200 
Pr(H,H,n) 0.000 
Pr(H,H,a) 0.133 
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PL = (PLn*TPLLn + PHn*TPHLn) + (PLa*TPLLa + PHa*TPHLa); 
PH = (PLn*TPLHn + PHn*TPHHn) + (PLa*TPLHa + PHa*TPHHa); 

PL + PH =1; 
PL = PLn + Pla; 


PH = PHn + PHa. 


Transition probabilities: 

TPLLn = 0.7; TPLLa = 0.8; 

TPLHn = 0.3; TPLHa = 0.2; 

TPHLn = 0.4; TPHLa = 0.6; 

TPHHn = 0.6; TPHHa = 0.4. 

Costs: CL = 0; CH = 20; advertising cost A = 5. 
The solution to this model is 

Objective value: 8.333333. 


Variable Value Reduced cost 
PLn 0.667 0.000 
PLa 0.000 2.222 
PHn 0.000 0.556 
PHa 0.333 0.000 
PL 0.667 0.000 
PH 0.333 0.000 


The two models containing probabilities as unknown variables presented above 
are solved using linear programming. From the values of these probabilities, we 
can identify the best policy given any state of the system. One can also use stochas- 
tic dynamic programming to find the best advertising policy directly given the 
current crime state. Each stage of the network is as shown in Fig. 13.9. The net- 
work clearly shows that no matter what policy is chosen, the ending states remain 
random. 

Using dynamic programming, we need to compute the minimum expected cost 
of all remaining months at each node or state for each successive remaining month 
m. Let Fm(S) represent that value for any state S (L or H) and the remaining 
number of months m. Working backwards from right to left and beginning with 
FoS) = 0, 


Fı(L) = min {[0.7Fo(L) + 0.3(Fo(A) + 20)]n, [(5 + 0.8Fo(L)) + (5 + 0.2(20 + Fo(H)) Ja} 
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Fig.13.9 Network Policy Probability 
representation of each stage 0.7 
of the stochastic dynamic al À —— 
: L n 0.3 
programming model for L 
crime reduction a~$5 ‘ 
_ 0:8 
0:2 
0.4 / $20 
0.6 $20 
H m 0.6 —$20 4 
a $5 0.4 $20 L—— 
= min (6,9) = 6. 


The best policy given state L with one month remaining is not to advertise. 


Fı(H) = min{[0.4Fo(L) + 0.6(Fo(H) + 20)],, [(5 + 0.6Fo(L)) + (5 + 0.4(20 + Fo(B))Ja} 
= min (12, 13) = 12. 


Again, the best policy given state H with one month remaining is not to 
advertise. 

Continuing backward, the general recursion equations for each successive 
remaining month m are: 


Fn41(L) = min{[0.7Fm(L) + 0.3(Fm(H) + 20)]n, [6 + 0.8Fm(L)) + (5 + 0.220 + Fm (Ð))la}; 


Fm+ı (H) = min{[0.4Fm(L) + 0.6(Fm(H) + 20)]n, [(5 + 0.6Fm(L)) + (5 + 0.4(20 + Fm(H))]a}- 


The process can stop when the minimum cost policies k (decisions n or a) 
remain the same for the same state in two successive months or when the differ- 
ences Fm+1(S) — Fm(S) equal the same constant for both values of S. This constant 
in this example will be the minimum monthly expected cost, 8.33. 

The results from solving a succession of 10 recursive equations for each state 
are given in Table 13.4. Instead of using subscripts for the remaining months m, 
that value will be included in the function. For example, Fm(S) is shown as F(S,m) 
and F(S,m) = minx Fm(S,k). 

This expected monthly cost of 8.33 can be compared to the monthly expected 
cost if one decided not to advertise. The difference of the two expected cost values 
would identify the expected monthly benefits of adopting the optimal advertising 
policy (i.e., only advertise if in state H). The non-advertising expected monthly 
cost can be determined by solving the sequence of recursive equations: 


Fn41(L) = 0.7Fm(L) + 0.3(Fm(H) + 20)where Fo(L) = 0, 


Fing1(H) = 0.4Fm(L) + 0.6(Fm(H) + 20)where Fo(H) = 0, 
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Table 13.4 Selected model solutions showing minimum expected costs given rate of crime and 
months remaining 


Variable F(S,m) | Value | Best policy | Variable F(S,m+1) | Value | Best policy | Difference 
F(L,1) 6.0 n F(L,2) 13.8 n 7.8 
F(L,3) 22.08 jn F(L,4) 30.408 |n 8.328 
F(L,5) 38.741 |n F(L,6) 47.074 |n 8.333 
F(L,7) 55.407 |n F(L,8) 63.740 |n 8.333 
F(L,9) 72.074 |n F(L,10) 80.407 |n 8.333 
F(H,1) 12.0 n F(H,2) 21.4 a 9.4 
F(H,3) 29.84 ja F(H,4) 38.184 | a 8.344 
F(H,5) 46.518 | a F(H,6) 54.852 |a 8.333 
F(H,7) 63.185 | a F(H,8) 71.518 |a 8.333 
F(H,9) 79.852 | a F(H,10) 88.185 | a 8.333 


until the difference Fm, ;(S) - Fm(S) equals the same constant for each value of 
the crime state S. 
Rounding to the nearest tenth, 


Fı(L) = 0.7(0) + 0.3(0 + 20) = 6. 
F,(H) = 0.4(0) + 0.6(0 + 20) = 12. 


Fo(L) = 0.7(6) + 0.3(12 + 20) = 12. 
Fo(H) = 0.4(6) + 0.6(12 + 20) = 21.6. 


F3(L) = 0.7(12) + 0.3(21.6 + 20) = 20.9. 
F3(H) = 0.4(12) + 0.6(21.6 + 20) = 29.8. 


F4(L) = 0.7(20.9) + 0.3(29.8 + 20) = 29.6. 
F4(H) = 0.4(20.9) + 0.6(29.8 + 20) = 38.2. 


Fs5(L) = 0.7(29.6) + 0.3(38.2 + 20) = 38.2. 
Fs5(H) = 0.4(29.6) + 0.6(38.2 + 20) = 46.8. 


Note the difference F5(L) — F4(L) = 8.6 and the difference F5(H) — F4(H) = 
8.6, and thus, the expected additional benefits from advertising are 8.6 — 8.3 = 
0.3. 

Finally, given any policy, optimal or not, one can compute the probabilities of 
being in any state. For this problem in which advertising is only implemented 
when in a high crime state, the transition probabilities from one state to another 
are shown in Fig. 13.10. 

Solving for the steady-state probabilities of L and H 


Pr(L) = Pr(L)0.7+ Pr(H)0.6 or Pr(H) = Pr(L)0.3 + Pr(H)0.4 
and Pr(L) + Pr(H) = 1 
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Fig. 13.10 Transition State in month m+1: L H 
probabilities if an optimal 
olicy is followed 
ii State in monthm L 0.7 0.3 
H 0.6 0.4 


results in 


Pr(L) = 0.667 and 
Pr(H) = 0.333, 


as previously determined using the linear model involving unknown joint proba- 
bilities. 

This illustrates that one can obtain both operating policies (k given S) and state 
probabilities (Pr(S)) solving either linear or dynamic programming models of this 
or similar stochastic optimization problems. In one case, we find the optimal joint 
probabilities of states and policies and derive the operating policies from them. 
In the other case, we find the optimal policies and derive their joint probabilities. 
Neat! (Fig. 13.11). 


Fig. 13.11 The game of squash racquets, another example of a stochastic process 


Exercises 


1. Predicting weather. 
The mayor is considering having a $100-dollar a plate dinner to increase the funds 
available for the homeless. His problem is that he doesn’t know how many people 
might come. Experience suggests the attendance largely depends on whether it 
rains or not. 
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The probability of a dry day depends on the past day’s condition. The local 
weather service has provided the following conditional probabilities of dry and 
wet days: 


Day t+1: Dry Wet 


Dayt: Dry 0.80 0.20 
Wet 0.47 0.53 


Invitations must be sent out at least two weeks in advance. 

(a) What is the probability of the selected day being a dry one? 

(b) Should the guests be encouraged to bring an umbrella? For this problem, 
make up convenience ‘benefits or costs’ for each possibility: For example, if 
it is dry and they do not bring an umbrella, or if it is wet and they do bring an 
umbrella, the benefit can be 10. If it rains and they do not have an umbrella, 
the benefit is -10. If it is dry and they have one, it is 5. 

2. Gambling 

You are given an opportunity to begin with an investment of $1 in a succession of 

gambles where in each iteration there is a 90% chance of doubling your money 

and a 10% chance of losing all the money won plus your initial $1. You can quit 
playing at any time. What are your expected earnings and the probability of having 
them for successive iterations, and when, and why, would you stop playing? 

3. Crime Reduction 

A community center provides recreation facilities for young people. Among the 

benefits to the community are lower crime rates. Assume there are two states of 

crime rates—low (L) and high (H). Observed crime rates over time show that 
if the crime rate is low in any month, the probability of having a low rate the 

following month is 0.5. The probability of having a high-rate month following a 

low-rate month is 0.5. If the crime rate is high in a month, the probability of a high 

rate the following month is 0.9, and thus, the probability of a low rate next month 
is 0.1. These probabilities apply if the community center does not advertise. This 
is the ‘do-nothing’ policy. (Policy n). These conditional probabilities are shown 
in Fig. 1. However, if the center advertises its recreation programs, (policy a) the 

conditional probabilities change to those shown in Fig. 2. 

The community center can change its policy at the beginning of each month. 

The high crime month costs 20 more than the low crime month, and advertising 

costs 10 per month. 


Month t+1 Month t+1 
Policy n: L H Policy a: L H 


Montht L 0.5 0.5 Montht L 0.8 0.2 
H 0.1 0.9 H 0.6 0.4 


176 13 Modeling Stochastic Processes 


Show how you would determine what policy to implement following each type 
of month (low or high crime rate) to minimize the total expected cost of crime 
and advertising expense. 

Hint: You can use dynamic programming along with the network below if you 
wish. Work backward. Stop when the minimum cost policies (decisions) remain 
the same in two successive months. 


a an ana 
XoXo & « 


SY 
A 


Solve for the steady-state policy that doesn’t change given the state (H or 
L) over time. You solve the problem represented by the network above, using 
dynamic programming or linear programming where the variables are the joint 
probabilities of states and decisions. 

4. You are considering a 3-day trail maintenance project in a state park. The weather 

for the last 10 days has been the following: 

Good, Good, Good, Bad, Bad, Good, Good, Bad, Good, Good. 
(a) Compute the probability of having three consecutive days of good weather. 
(b) Compute the probability of having at least one bad weather day in those three 

days. 
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Modeling 


ABSTRACT 


Constraints of models that contain random variables may be applicable only 
some of the time. Constraints that apply only a specified fraction of the time are 
called chance constraints. This chapter illustrates how chance constraints can be 
included in optimization models. In addition, the chapter demonstrates how to 
generate values of random variables fitting user defined probability distributions. 
These random variable values often serve as inputs to stochastic simulation 
models. 


14.1 Chance Constraints 


In the previous chapters where constraints were developed for various optimiza- 
tion models, for the models to have a feasible solution, all the constraints had to 
be met all the time. Consider a situation in which forcing them to be met all the 
time in a model may be unrealistic. For example, suppose the problem involves 
allocating resources to a potential user and the supply of resources, R, available 
to allocate is random. If the potential user is planning to invest in equipment to 
be able to convert those allocated resources to benefits, the question is just how 
many resources should the user base his or her investment decision on. How many 
resources should the user plan on receiving when the user knows the actual alloca- 
tions may vary over time? If the user expects 100% reliable resource allocations, 
then such allocations would be the minimum level of resources available for allo- 
cation even though most of the time the allocations could be greater. In such cases, 
the user may be missing out on the opportunity to generate more benefits most of 
the time when more resources are available. The problem is to decide how many 
resources the user should plan on getting. This involves a tradeoff between the 
benefits generated and the reliability of those benefits. 
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Fig.14.1 Cumulative 

distribution of the random 0.9 
variable R. The value r?° is a | 
possible value of the random Fa(r) | 
variable R that exceeds 90% | 
of all values of the random 0.1 


variable R r 
O r1 0-9 


Consider the constraint of staying well and avoiding an infectious virus. To 
guarantee meeting that constraint may involve measures, such as complete isola- 
tion in a sterile environment, that few would want to take. Doing less than that 
involves some risk, the amount depending on what measures are not taken. Again, 
a tradeoff exists between say the degree of freedom from virus protection measures 
and the probability of getting sick. 

These are two examples where the constraints specified in a model may include 
their reliabilities. Such constraints involve random variables whose distributions 
are either known or can be calculated. Hence, in general, if a constraint g(X) is to 
be no greater than some random variable R P%, of the time, it is called a chance 
constraint and is written as 


Pr(g(X) < R) > P. 


Models that include them are called chance constrained models. But before 
such models can be solved, these chance constraints must be converted to their 
deterministic equivalents. Referring to the sketch of the cumulative distribution in 
Fig. 14.1, one can see how this is done for the example chance constraints when 
P, expressed as a fraction, is 0.9. 

In this sketch, the horizontal axis represents possible values r of the random 
variable R. The vertical axis represents values of the cumulative probability dis- 
tribution of the random variable R. Like all cumulative distributions, the values 
range from 0 to 1 and represent the probability of any specific value of r being 
greater than an outcome of R. Pr(r > R). Hence, by definition, 


Pr(®! < R) = 0.9 or  Pr(®! > R) = 0.1 


since there is a 10% chance that the outcome of R will be less than r®!; 


Pre’ < R) = 0.1 or Pr(®? > R) = 0.9 


since there is a 90% chance that the outcome of R will be less than r®?. 


Thus, to define the deterministic equivalent of Pr(x > R) > 0.9, we need to ask 
what values of x will exceed the outcome of the random variable at least 90% of 
the time. What value of r is a lower limit of x? If we set 


x> °’, 
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this will ensure that x will be greater than the outcome of R at least 90% of the 
time. The value r°? is the lower limit of x. This expression is the deterministic 


equivalent of Pr(x > R) > 0.9. 


Similarly, 


Pr(x > R)>0.9=x>1°?. 


Pr(x < R)>0.9=x<21"!, 


Knowing the cumulative distribution values associated with any value r, the 
and r°? can be determined. Assume the value of 
the cumulative probability distribution is r/(1+r). It is 0.1 when r is r®!, 0.1 = 
r®!/(1+r®!) or rè! = 1/9. Likewise, 0.9 = r°-9/(1+r%) or r?° = 9. Thus, 


calculation of the values r 


0.1 


Pr(x > R) > 0.9 =x > 9; 
Pr(x < R) > 0.9=x< 1/9. 


Note that multiplying both sides of any chance constraint by —1 reverses its 
inequality. For example, 


Pr(x > R) > 0.9 = —Pr(x > R) < —0.9. 


When adding one to both sides of the constraint, it becomes 


Hence, 


1 — Pr(x > R) < 1 — 0.9 = Pr(x < R) < 0.1. 


Pr > R) > 0.9 = 1 — Pr > R) < 1— 0.9 = Pr(x < R) < 0.1=x > b’. 
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To summarize, the deterministic equivalent of the chance constraint Pr(f(x) < 
R) > P is f(x) < r where the value of the random variable r is defined by the 
exceedance distribution function 1 — Fr(r) = Pr(r < R) when it equals P. Likewise, 
the deterministic equivalent of the chance constraint Pr(f(x) > R) > P is f(x) > r 
where r is defined by the cumulative distribution function Fr(r) = Pr(r > R) when 
it equals P. 

Setting diffèrent values of P and finding the associated x, and hence the benefits 
obtained from x, provides the tradeoff between the benefits and their reliability that 
the user can consider when making an investment decision. 
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Fig.14.2 Sketch of the Input Variables 


Monte Carlo sampling 
process to provide inputs to a Output Distribution 
simulation model à 
di | 
= e 


14.2 Monte Carlo Sampling 


Consider a simulation model of a system that has random inputs as shown in 
Fig. 14.2. The system could be a hospital having patients entering and leaving, or 
a toll booth servicing arriving traffic, or a reservoir with entering flows, or people 
entering and leaving a public library, etc. To simulate such systems, we need values 
of those random inputs. If we know or assume the probability distributions of those 
random variables, Monte Carlo sampling methods are ways of obtaining values of 
these random variable inputs that fit the distributions from which they came. 

To illustrate, consider the random variable R whose cumulative distribution is 
as shown in Fig. 14.1. Its cumulative distribution, FgR(r), is r/(1+r). 

Except for a few commonly used distributions, computer programs such as 
Excel are not able to generate a series of random variable values that fit some 
arbitrary probability distribution. But they are commonly able to generate a uni- 
formly distributed series of random variable values p ranging from 0 to 1. If these 
p values are values of a cumulative probability distribution of R, the corresponding 
values r of the random variable R can be computed. For example, if 


Fr(r) = p =r/(1 +r), then r= (1 + r)p, orr = p/(1 — p). 


This is the inverse, FR! (p), of the cumulative distribution, Fg(r). It is used to 
find r given p instead of finding p given r. The values of r associated with the 
uniformly distributed p values will fit the cumulative probability distribution of R. 
They will not exceed the limits of the distribution and will have approximately the 
same mean and variance and median as the original distribution given a sufficient 
number of samples. 

Assume it is of interest to find the probability that a random variable value x 
exceeds a particular threshold. The cumulative probability distribution of X, Fx(x) 
= (x — 5)/10, where the values of the random variable X range uniformly from 5 
to 15. 

To generate random values of x that fit the uniform distribution whose cumu- 
lative distribution is shown in Fig. 14.3, we can first generate a set of random 
uniformly distributed values of p, representing values of Fx(x), and corresponding 
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0 5 10 15 x 


Fig. 14.3 Cumulative probability distribution of a random variable X having a uniform distribu- 
tion fx(x) = 0.1 from x = 5 to 15 


fy(x) | Aa 2la — 2x/a2 | AI(x+1)2 
D x 
F(x) | 
P xla 2xla — x?la? xI(1+x) 
x 


a a 


-1 
R (p) x=ap Note: p = 14- (4-x/a)? x = p/(1-p) 
x=a(1- (1-p) °) 


Fig. 14.4 Examples of finding the inverses of cumulative distributions so that values of the ran- 
dom variable x drawn from their probability distributions can be determined from uniformly 
distributed random numbers p ranging from 0 to | 


x values, x = 5 + 10p. Then we can include in the simulation a counter, keeping 
track of the number of x values exceeding a given threshold, say 14. Clearly, the 
values of x generated that exceed 14 should be about 10% of the samples gener- 
ated. In one such simulation of 100 samples, the percent was 11. More samples 
might lead to a more precise estimate. This is easily accomplished using Excel. 

Figure 14.4 illustrates various density and cumulative distributions and their 
inverses needed to draw samples from those distributions. 

The function RANDO) in Excel can be used to generate the uniformly distributed 
(equally likely) random values of p. Knowing any p value, the inverse function 
Fx! (p) can be used to find the corresponding value of x. 


14.3 Another Example 
Consider a symmetric triangular probability density function that ranges from 0 


to 10 whose mean and most likely value is 5. This density and its cumulative 
distribution function are sketched in Fig. 14.5. 
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0.2 — 0.04 (x-5) = 0.4 — 0.04 x 


0.04x i a 


0 5 10 x 


-1 + 0.4x — 0.02 x? 


: l 
cumulative prob. for5<x<10. 


Pr(X < x) 5 Ẹ--= h 
4 0.02 x“ forO<x<5 


0 5 10 x 


Fig. 14.5 Probability and cumulative distribution of a triangular distributed random variable 


Note that —1 + 0.4x — 0.02x* = 1 — 0.02(10 — x)*. Therefore, x = 10 — ((1 — 
p)/0.02)> for 0.5 < x < 10. 

Using this cumulative distribution function, any value of the cumulative distri- 
bution function, p, can be converted to a value of the random variable, x, having a 
symmetric triangular distribution. 


x(t) = (p(t)/0.02)°°, if (p(t) < 0.5), 
= 10 — (1 — p(t)/0.02)°° otherwise. 


Using the above equations, sets of random uniformly distributed values of p rep- 
resenting cumulative distribution values ranging from 0 tol were generated along 
with their corresponding x values that have this triangular probability distribution. 

Of interest here is how the mean and variance of all the x(t) values compare to 
the true mean and variance of the triangular distribution. Given a sample size of 
n, 


mean = (È o) /n; 


1 
variance = (È (x(t) — mean?) /n 
1 


The comparisons are shown in Table 14.1. One way to generate uniformly dis- 
tributed random numbers from 0 to 1 is shown in Fig. 14.6. Just subtract 2 from 
their sum and divide by 10. 


Exercises 


1. Consider an ‘allocation problem’, but with chance constraints on meeting random 
demands Dj at demand sites j. For example, if you wanted your allocation Aj to 
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Table 14.1 Results of Sample sizen |Mean | Variance 
Monte Carlo simulations of 
various sizes 10 4.366 | 1.916 
100 5.016 | 3.318 
1000 5.091 | 3.991 
9000 5.011 | 4.119 
9999 5.016 | 4.132 
True 5.000 | 4.167 Derived from calculus 


Fig.14.6 One means of 
Monte Carlo sampling. Using 
a computer (e.g., RANDO) in 
Excel) is faster. 


user j to meet or exceed the user’s demand Dj at least 95% of the time, the chance 
constraint is 


Pr{Aj = Dj} > 0.95. 
The deterministic equivalent is 
Aj = ap? where ap? is the demand that is exceeded only 5% of the time. 


Assume the cumulative distribution of demand d is d/(1+d). This is the prob- 
ability that the actual random demand will be less than d. When d is 0, the 
cumulative probability is 0. There is no probability that the actual demand will 
be less than 0. As d increases, the probability that the random actual demand will 
be equal or less than d approaches 1. Therefore, do, the demand that will be 
exceeded only 5% of the time, can be computed. The actual allocation, Aj, must 
be at least this amount to satisfy the demand at least 95% of the time. 

The demand alae ) whose probability of being at least equal to the actual 
demand 95% of the time is determined by setting the cumulative distribution to 
0.95. 


0.95 = d/(1+d);d = 0.95 + 0.95d; thus, d = 0.95/0.05 = 19. 


Hence, the deterministic equivalent of the chance constraint is Aj > 19. 
(a) Define the deterministic constraints for the following: 
(i) Pr{Aj = Dj} = 0.8. 
(ii) Pr{Aj < Dj} < 0.10. 
(iii) Pr{Aj > Dj} < 0.50. 
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(b) Generate a series of random uniformly distributed probabilities and their 
corresponding values of demand d. The proportion of d values less than or 
equal to 19 is a way to see if the minimum allowable allocation of 19 will 
satisfy the random demand at least 95% of the time. Now you can also check 
on your answer to (i) and (ii) above as well. 


2. Consider an allocation problem where the supply of resources available for various 
users in each time period is uncertain. Assume the supply’s probability distribu- 
tion in each time period is uniform between 5 and 15. Users want to know the 
tradeoff between what they can count on and its reliability. If your objective when 
allocating the available resources is to minimize the maximum percentage deficit 
between what each user wants and what they get, or equivalently their maximum 
level of satisfaction, show the model you would use to generate the information 
they desire. 


3. Monte Carlo sampling. 
(a) Show how you would generate equally likely values of the random variable 
X that has the following probability distribution: 
Show how to compute the mean or expected or average value, and the variance, 
of n discrete x(t) values randomly generated from this probability distribution. 


4. Consider a random variable X that has the following discrete probability 
distribution, ranging from 2 to 5. 


(a) Describe how to generate multiple discrete values, x(i), of the random variable 
X that fits this distribution. 

(b) Write the equations for calculating the mean and variance of all the n values 
you obtained. 


5. You are having to decide how many trucks you need to purchase and drivers you 
need to hire to pick up trash each day. Between 10 and 30 truck-day units of trash 
are produced each day, and these amounts are uniformly distributed. All the trash 
must be picked up each day. Each truck can haul enough to bring in $ 600 per 
day. However, for each day a truck and driver are idle because there is not enough 
trash to pick up, the loss is $800 per truck. If private contractors must be hired to 
pick up any excess trash, the cost is $200 per truck per day. 


14.3 Another Example 185 


Example: If 20 trucks are available (the target) and only 18 are needed, the 
net income is 20(600) — 2(800). If 22 trucks are required, the net income is 
20(600) — 2(200). 


(a) Describe how to determine the most economical target number of trucks to buy 
using the Monte Carlo sampling. 

(b) Develop and solve an optimization model for finding the number of trucks to 
buy that maximizes expected net income. 

(c) If you wanted to be sure that your target number of trucks would be able to 
pick up all the trash produced at least 90% of the time, what would be the target 
number? 
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ABSTRACT 


An introduction to deterministic and stochastic system simulation modeling and 
various statistical measures of their outputs. 


15.1 Introduction 


Simulation methods address ‘what if? questions. Given a set of assumed values 
assigned to all decision variables and parameters in a model of some system, and 
given the set of assumed external inputs, a simulation of the system produces 
model outputs that can be used to compare with other simulations based on other 
sets of assumed values in the search for the ‘best’ system decision variable values. 
Models used for simulating a system can be as detailed in their representation of a 
system as desired, as there are no restrictions imposed by the method of solution, 
as is the case for all the optimization methods presented in the previous chapters. 
Thus, simulation model outputs can be more realistic indications of system perfor- 
mance, again given the assumptions made when developing any simulation model 
and setting the values of its parameters and variables. 

Simulation methods can be applied to natural, engineered, or social systems to 
gain insight into their functioning or performance. For example, simulation models 
are used to predict the impacts of traffic congestion, or the spread of a contagious 
disease, or to estimate the likely damage resulting from flooding events in a com- 
munity. Computer based simulations of systems are useful and much less expensive 
and quicker to perform than designing and building and operating alternative real 
systems and waiting to find out how well they performed over time. 

In situations in which the number of alternatives that warrant such simulations, 
together with the time required to evaluate the output of each alternative simula- 
tion, takes too much time, some sort of preliminary screening of alternatives may 


© The Author(s) 2022 187 
D. P. Loucks, Public Systems Modeling, International Series in Operations Research 
& Management Science 318, https://doi.org/10.1007/978-3-030-93986-1_15 


188 15 Simulation Modeling 


Fig.15.1 Flight training simulators that include humans in the simulation. https://en.wikipe 
dia.org/wiki/Flight_simulator#/media/File:9803 10-N-7355H-03_Simulator_Training.jpg File: 
SSJ100 FFS 1 (9318513805).jpg. https://en.wikipedia.org/wiki/Flight_simulator Public Domain 
and CC BY-SA 2.0 


be useful. In many cases optimization modeling can serve as a means of prelim- 
inary screening. Optimization can be performed not necessarily to find the best 
values of decision variables but to eliminate from further consideration the clearly 
inferior ones. 

Interactive simulation methods, sometimes referred to as human-in-the-loop 
simulations, are simulations that include humans making decisions as the simu- 
lation proceeds and responds to those decisions. Humans are making decisions 
based on the state of the system and external factors while the simulations are 
taking place. Examples include flight, rail, ship handling, or bus driving simu- 
lators. Such simulations, as illustrated in Fig. 15.1, are often used for training 
system operators, but they can also be used to learn more about how a system 
should be designed and/or managed or operated and about human behavior or 
decision-making under various system states. 

Computer simulation has become a useful way to study many systems in 
physics, chemistry, biology, engineering, agriculture, business, economics, regional 
planning, and sociology among other application areas. Humans are often part of 
all such systems even though not always included in the simulation models. 


15.2 Stochastic Simulations 


As discussed in Chap. 14, Monte Carlo sampling provides a means of generat- 
ing random values from given probability distributions. The name comes from its 
resemblance to what takes place in a real gambling casino. Monte Carlo methods 
are often useful when random inputs and outputs apply in any system simulation. 
Many systems have random inputs. Hospitals, police and fire departments, shelters 
for the homeless, libraries, schools, food pantries, and public parks are among the 
many examples of public systems having random inputs. Simulating such systems 
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can benefit from the use of Monte Carlo methods to provide random inputs that 
come from realistic probability distributions. 

Associated with a set of inputs, a simulation model will produce a correspond- 
ing set of outputs. Each alternative system simulated many times will have its 
own output distribution. Statistical measures of these output distributions provide 
a basis for comparing alternative system performances. 

Two example simulation models follow. 


15.3 Water Quality Simulation 


Consider a small fully mixed lake (Fig. 15.2) having a constant volume V. Its 
inflow Q contains a pollutant W. By simulating the lake’s quality, one can estimate 
what the pollutant concentration, C, of the lake will be over time. As we develop 
this simulation model, we will start with a simple one and add more realism later. 

To begin, assume the inflows Q and pollutant loadings W are constant over 
time. Thus everything is constant except the pollutant concentration in the lake 
until it too reaches an equilibrium and does not change over time. Also assume, 
since the volume of the lake is constant, the inflow equals the outflow (and there 
is no significant evaporation or seepage). 

Defining the variables and parameters needed to model this lake, we will be 
dealing with units of mass (M), length (L), and time (T) (Table 15.1). 

The mass of pollutant input per unit time period, W, is its flow discharge times 
its concentration. Its flow discharge is included in the total inflow to the lake, Q. 

The pollutant decay constant k is the mass of pollutant loss per unit of mass 
available per time period (i.e., a day) (M/M/T or 1/T). Its value depends on the 
type of pollutant as well as the water temperature. 

To create a mass balance equation for the pollutant in the lake, we can equate 
the change in mass of pollutant in the lake to the mass that comes into the lake 
minus the mass that is contained in the lake outflow and the amount that decays 


Inflow Q; Waste W 


Outflow Q; 
Concentration C 


Fig.15.2 A constant volume lake receiving wastewater containing a pollutant 
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Table 15T Norton sed 10 Descriptor Variable or | Units Example 
develop a simulation model 
: . , parameter 

that will predict the changing 9 <_< ——————— < 

quality of the lake Water volume | V L? Cubic meters 
Inflow, Q L3/T Cubic meters 
outflow per second 
Pollutant mass | M M Kilograms 
Pollutant input | W M/T Kilograms per 

day 

Pollutant C M/L? Milligrams 
concentration per liter 
Pollutant k M/M/T = 1/T_ | I/day 
decay constant 


while in the lake. Each term in this mass balance equation will have units of 
mass/time or M/T. 

Denoting the change of pollutant concentration, C, in the lake over time t, as 
dC/dt, ( M/L3/T), 


VdC/dt = W—QC-KkVC. 


Thus the change in mass of pollutant in lake = V dC/dt, (L3°M/L3/T = M/T), 
equals. 


the mass that comes into the lake = W (M/T), 
less the mass that is discharged from the lake = QC, ((L7/T)(M/L?) = M/T), 
less the mass that decays in the lake = kVC, ((M/M/T)(L3/T)(M/L3) = M/T). 


Since volumes, flows, pollutant inflow concentrations and decay rates are constant 
over time, eventually the lake pollutant concentration will become constant. It 
will not change over time. The term dC/dt in the above mass balance equation 
will be 0. Solving this equation when dC/dt is 0 for C will define its equilibrium 
concentration value, C4. 


C4 = W/(KV +0). 


Using discrete simulation for this deterministic system, we can see what hap- 
pens to the lake’s pollutant concentration on its way toward an equilibrium. In 
other words, we can generate a time series of predicted lake concentrations C(t) 
at the beginning of each time period ¢ given an initial concentration, C(1), at t = 
1. 

Let dC be approximated by (C(t + 1) — C(t)) and dt by At. Then the mass 
balance equation can be written as 


Vice t+ D-C(t)) = [W—O(CE + 1) + C(t) /2-kV (C(t + 1) + C(t))/2] At. 


15.4 Lake Quality Simulation with Random Wasteloads 191 


Table 15.2 Successive lake Time period k=0 k=01 
pollutant concentrations C(t) 
for two different values of the 1 5.0 5.0 Assumed 
pollutant decay constant k 2 13.08 11.21 
3 17.43 13.56 
4 19.77 14.45 
5 21.03 14.79 
6 21.71 14.92 
7 22.07 14.97 
8 22,27 14.99 
9 22.37 15.0 
10 22.43 15.0 
11 22.46 15.0 
12 22.48 15.0 
13 22.49 15.0 
14 22.49 15.0 
15 22.50 15.0 = C% 


This equation assumes that the units of all the parameters and variables are 
consistent, and the outgoing concentration in each period t is the average of the 
initial and final concentrations in the lake in that period. 

To simulate a numerical example, assume W = 450; Q = 20; k = 0 and 0.1; 
V = 100; At = 3; and an initial pollutant concentration, C(1), is 5. The model’s 
successive solutions are listed in Table 15.2 for 19 3-day time periods. 


15.4 Lake Quality Simulation with Random Wasteloads 


Consider the same lake having a constant volume, inflow and outflow, and pollutant 
decay rate, but with a random pollutant loading. The concentrations of pollutants 
entering the lake are described by a probability distribution. For this example, 
assume this probability distribution of pollutant inputs, W(t), is uniform, ranging 
from 200 to 700 with a mean of 450. Let At = 1. We can now generate a time 
series of W(t) and C(t) and based on that time series, compute the mean and 
standard deviation of the waste loads W(t) and lake pollutant concentrations C(t). 

For purposes of comparison, we can assume the same deterministic values for 
inflow, lake volume, and a decay constant of K = 0. We can define the cumulative 
distribution of pollutant mass inputs W(t) per unit time and use it to convert a 
generated series of uniformly distributed random variable values, p, ranging from 
0 to 1, to corresponding random variables of W(t) distributed uniformly from 200 
to 700 (Fig. 15.3). 
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Fig. 15.3  Wasteload probability distribution and its cumulative distribution 


The cumulative probability, p(t), of a pollutant loading of W(t) at time f is 
(W(t) — 200)/500 for W(t) between 200 and 700, hence 


W(t) = 200 + 500 p(t), 


and 


VCE + 1) = CO) = (WH) — OCE + D + CO))/2 
— kV(C(t+ 1) + C())/2) At. 


Starting with an initial lake concentration of C(1) = 0, one simulation of 100 
daily time steps (At = 1) resulted in a mean pollutant mass input of 437 (compared 
to a true mean of 450), with a standard deviation of 130. 

In this simulation the lake pollutant concentration, C(t), reached a value exceed- 
ing the equilibrium concentration of 22.5 in less than 20 days. The mean of the 
remaining concentrations was 19.6 with a standard deviation of 4.2. 

Some of the concentrations at the beginning and end of this particular 
simulation run are listed in Table 15.3. 


15.5 Possible Chaos 

This next purely mathematical example shows how the values of assumed param- 
eters in a discrete simulation model along with the duration of the simulation time 
step may alter the path toward an equilibrium, even to one that may not reach an 
equilibrium even though an equilibrium exists. The model is defined by the simple 


differential equation 


dx/dt = (a — l)x - ax? the rate of change in 


x depends on the value of x and a parameter ‘a’. 
We can find the equilibrium solution by setting dx/dt equal to 0. 


0 = (a — 1)x — ax” = (a — 1) — ax, sox = (a — 1)/a, 
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Table 15.3 Sample Day Initial concentration 

simulation and summary 

statistics of lake pollutant 1 0.00 

concentrations 2 3.78 
3 6.51 
4 8.16 
5 11.96 
6 12.09 
20 23.39 
21 22.62 
97 23.65 
98 24.40 
99 22.56 


thus an equilibrium exists when 
x = [(a — 1)/a] for any non - zero value of a. 


Clearly, as the value of the parameter ‘a’ increases, the equilibrium value of x 
approaches, but never reaches, 1. 

A question is how will successive values of x tend toward their equilibrium 
values and will an equilibrium ever be reached if the system is not already in an 
equilibrium? In other words, are the equilibriums stable? 

Consider a discrete simulation where dx/dt is approximated by 


Ax/At = (x(t + l)—x(t))/At = (a — 1)x(t)—ax(t)’, 


which can be written as x(t + 1) = x(t)) + ((a — 1)x(t) — ax(t)?)At. 

The plots in Fig. 15.4 show successions of x values given six different non- 
negative values of the parameter ‘a’ and a simulation step size Ar starting at x(1) 
= 0.2. The smaller the step size Ar the larger the value of ‘a’ for which the 
equilibrium is stable. With a step size of 0.5, if ‘a’ is 6 the sequence of x values 
corresponds to the graph showing ‘a’ of 3.5 with step size of 1. 

This example illustrates how simulations of deterministic non-linear systems 
can be sensitive to initial conditions and simulation step sizes, and in some cases 
even show apparent random behavior. 


15.6 Endowment Giving 


Many organizations, including those shown in Fig. 15.5, count on income from 
their endowment to cover some of their capital and operating costs. There is a 
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Fig. 15.4 Plots showing the impact on x(t) values over time given some different values of ‘a’ and 
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Penn COLUMBIA HARVARD Dartmouth 


Yale BROWN PRINCETON 


Fig. 15.5 Just a few of the most highly endowed universities in the US 


strategy in raising an endowment. If an endowment campaign looks like it will be 
successful to potential donors, they are more likely to contribute to the endowment 
than if they think it will be unsuccessful. One measure of potential success is the 
amount of money already given. This is why some major donations are often 
sought before the ‘publicly announced’ campaign begins. Yet there may also be 
some who are reluctant to give to an organization if the total amount already 
raised is already very large, especially donors wanting to maximize the marginal 
values of their donations. Giving a specific amount of money to a well-endowed 
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Fig.15.6 A function for 

estimating the amount of 

giving over time, each 

amount being dependent on E(t+1) 
the amount, E(t), already 

given shown on the 

horizontal axis 


a E(t)” - 100 


stable equilibrium 


unstable equilibrium 


-100 heed at least this to build up an 


endowment. 


organization will likely result in a smaller marginal value than that derived by an 
organization having a smaller endowment but similar expenses. 

These notions are captured in the following example illustrated in Fig. 15.6. The 
variable E(t) is the level of giving already raised in the campaign by the beginning 
of period t. 

The above plot shows a function used to predict the money raised over time, 
each amount raised being dependent on the total endowment already raised, E(t), 
by the beginning of period t. The change in the total endowment in period t is E(t 
+ 1) — E(t), and E(t + 1) is 


E(t +1) =a E(t)”’—100. 


At equilibrium E(t + 1) = E(t). 

Hence when E = a E®’ — 100 the system is in equilibrium. 

If ‘a’ is 50, E = 2.800119 or 460,170.5. 

The lower equilibrium is unstable. If E(t) is less than 2.8, the following E(t + 
1) will be even smaller, which in fact will not happen, but it indicates a decreas- 
ing interest in donor giving, at least until E(t) reaches 2.8. Perhaps this shows 
why many fund-raising campaigns are not announced until the organizers have 
already raised a substantial amount. If E(t) is greater than 2.8 in this example, 
then the following values of E(t + 1) will be even more until its value equals 
its upper equilibrium value. Beyond that upper equilibrium value donors are less 
likely to give more, perhaps feeling the organization’s endowment campaign has 
raised enough money. The fact that mathematically changes in the endowment are 
negative below the lower equilibrium value of 2.8 and above the upper equilib- 
rium value of 460,171 simply shows that the valid range of this function are for 
all values of E(t) between these two equilibrium values. 


e In addition to predicting the sequence of endowment giving that will occur over 
time, the total time, n, needed to reach a given total amount of money, T, can be 
estimated. The total amount of additional endowment at the end of n periods, 
assuming the endowment is invested at a compound interest rate of i per period 
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in following periods, is 


E(t + 1) =a[E(t)(1 +i)]°" — 100, t = 1, 2, ..., n where E(n+ 1) > T. 


15.7 Forest Management 


In a particular town watershed there exists two competing tree species: hardwoods 
and softwoods. The watershed is managed primarily to produce clean water, but 
it also serves as wildlife habitat and source of income from timber. Cutting trees 
in a sustainably managed way can increase water yields, habitat value, and timber 
income (Fig. 15.7). 

First consider an unmanaged forest. In an unmanaged forest, hardwood and 
softwood trees compete for the available sunlight, soil nutrients, and water. Hard- 
wood trees grow more slowly but are more durable and produce more valuable 
timber. Softwood trees compete with the hardwoods by growing more rapidly and 
by consuming water and soil nutrients in the process. Can these two types of trees 
coexist indefinitely, or will one type of tree drive the other type to extinction? 

One measure of the amount of forest growth in the watershed is the basal area 
of trees (Fig. 15.8). This is the cross sectional area of the trunk near the base of 
the tree. For both hardwood and softwood species the increase in basal area per 
hectare per year is directly proportional to the initial basal area of that species. 
However, this potential increase in basal area is reduced by the loss in basal area 
due to competition from its own species and from the other species. 

Let 


H(y) Basal area of hardwoods per hectare at the beginning of year y. 

S(y) Basal area of softwoods per hectare at the beginning of year y. 

rt basal area growth per unit basal area per hectare for species type f. 

ay basal area loss per unit of basal area of species type t per unit basal area 
of same species per hectare. 


Fig. 15.7 Unmanaged and managed hardwood and softwood forests. https://en.wikipedia.org/ 
wiki/Forestry By Queryzo—Own work, CC BY-SA 3.0, By SneZana Trifunovic—Own work, CC 
BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=2647911 CC BY-SA 3.0, https:// 
commons. wikimedia.org/w/index.php?curid=1975900 
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Fig.15.8 Defining the basal j 
area of a tree | 


Basal 


by basal area loss per unit of basal area of species type t per unit basal area 
of different species per hectare. 


Equations that describe the changes in basal area over time for both types of tree 
species can be written as 


dH /dy = ry H (y)—ay H (yy -by H(y)S(), 
dS/dy = rsS(y)—asS(y)"—bs H(y)S(y). 
These two differential equations can be expressed as difference equations that 


define the basal areas at the end of each year y, H(y + 1) and S(y + 1), in terms 
of H(y) and S(y). Assume dH/dy = AH/Ay. Similarly, replace dS/dy with AS/Ay. 


AH = H(y+ 1)—H (y), and 
AS = SO + 1)-S(y). 


Substituting these expressions into the differential equations above results in 
the mass balance equations: 


H +1) = AQ) + [re H(y)—an H (yY —by Hy) SQ)] Ay, 
SO + 1) = SQ) + [rsS(y)—asS(y)°—bs H(y)S(y)] Ay. 


These can be solved in succession starting with some initial conditions for H(1) 
and S(1). 
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There are four equilibrium solutions for these difference equations. Clearly one 
is when no trees exist. H = S = 0. Two others are when one or the other species 
does not exist. 


If H = 0, then from the softwood difference equation, S = rs/as, 


If S = 0, then from the hardwood difference equation, H = ry/ay. 


If both H and S are positive, then from both difference equations, the 
equilibrium values are 


H = (asry—byrs)/(asay—bybs), 
S = (ayrs— bsry)/(asay— bybs). 


For a numerical example let: ry = 0.3; rs = 0.5; ag = as = 0.1; by = bs = 
0.05. 
Thus if 


H is 0 then S = rs/as = 0.5/0.1 = 5, 
Sis 0 then H =ry/ay = 0.3/0.1 = 3. 


Otherwise if both H and S > 0, 


H = (asry — burs)/(asay — bybs) 
= (0.1 0.3 — 0.05 0.5 )/(0.1 0.1 — 0.05 0.05) = 0.667, 


S = (aynrs—bsry)/(asay—bybs) 
= (0.1 0.5—0.05 0.3 )/(0.1 0.1—0.05 0.05) = 4.667. 


This is the only stable equilibrium. At any of the other equilibria just one stray 
seed of a species that is missing from the forest will cause a move to a new 
equilibrium. 

Assuming different combinations of initial basal area values, the succession of 
basal areas will converge to their equilibrium values (Table 15.4). 

Too great a time step may result in negative basal areas. If this happens take 
shorter time steps by replacing Ay with 1/m where m is an integer >1. Continue 
making m greater until the simulation converges without oscillations (Fig. 15.9). 
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Table 15.4 Various Year H value S value 
simulations of forest 
hardwood H and softwood S 1 0 5 
basal areas given initial >l 0 5 
conditions 1 3 0 
>l 3 0 
1 1 1 
8 1.40 3.94 
16 1.06 4.42 
22 0.92 4.51 
30 0.85 4.56 
110 0.67 4.665 
>110 0.67 4.67 
1 5 5 
8 1.42 4.16 
24 0.88 4.54 
46 0.72 4.64 
1 0.5 5 
8 0.53 4.74 
24 0.6 4.7 
72 0.66 4.67 
5 0.5 
2.37 2.44 
24 1.05 4.43 
Fig.15.9 Velocity plot S 
showing H and S values 
converging to an equilibrium Ty/by = 
in a sequence of time steps 0.3 / 0.04 


Is/As = .. | Starting points for 
0.5/0.1 „77| some simulations 


Ty/ ay = Is/Ds = H 


0.3/0.1 0.5/ 0.05 
=3 =10 
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15.8 Military Battle 


Two armies are to engage in battle. The red army enjoys a three-to-one numerical 
superiority, but the blue army is better trained and better equipped. Let R and B 
denote the respective levels of the red and blue armies. The Lanchester model of 
combat states that 


dR/dt = —aB—bRB, 
dB/dt = —cR—dRB. 


where the parameters a and c are kills by the opposing armies per soldier per day 
and b and d are kills per soldier in both armies from friendly fire per day. The 
first term in each equation accounts for the direct fire (aimed at a specific enemy 
target), and the second term accounts for attrition of army personnel due to its own 
area fire (e.g., artillery), the intensity of which depends on the size of both armies. 
Solving a sequence of difference equations can yield estimates of the size of each 
army over time. For a > c and b > d, R(1) = 3n, B(1) = n, we can see who wins, 
i.e., which army population goes to O first. 


R(t +1) = R(t) —[a B(t) + DR) B(t)JAt, 


B(t +1) = B(t) — [c R(t) +d RBA. 


Assume At = 1, R(1) = 3000, B(1) = 1000, a = 0.004, b = 0.0002, c = 
0.002, and d = 0.0001, the sequence of remaining army personnel is shown in 
Table 15.5. 

One can see that the blue army will need to surrender to the red one if they 
want to have any personnel left alive. This prediction would suggest that unless 
the values of some of the Blue army’s parameters can be made more favorable, 
the Blue army should not be fighting the Red army. 


He pe ar ati Time period Red army Blue army 

over time 1 3000 1000 
2 2396 694 
3 2061 523 
4 1844 412 
5 1691 333 
22 1131 18 
23 1127 14 
24 1124 11 
25 1122 8 
26 1121 5 
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15.9 Disease Epidemic 


Consider a population of 70,000 that can catch a disease. The disease is sel- 
dom fatal and leaves the cured victim immune to future infections of this disease. 
Infection can only occur when a susceptible person comes in direct contact with 
an infectious person. The infectious period for people that get the disease lasts 
3 weeks (Obviously a less serious disease than COVID-19) (Fig. 15.10). 

To develop a discrete simulation model that can estimate the number of sick, 
susceptible, cured, and dead over the course of an epidemic, we need some data, 
and we need to make some assumptions and define some notation identifying 
needed variables and parameters. 

It seems reasonable that the change in the number of infected people is the 
difference between the rate of infection and the rate of being cured or dying. 
The rate of infection will depend on both the number of susceptible people and 
the number of infected, and therefore contagious, people. Both susceptible and 
infected people must exist for the disease to spread. Letting S(t) be the number 
of susceptible people at the beginning of period t, and Z(t) the number of infected 
people at the beginning of period ¢, then one possible model for predicting the 
number of newly infected people in each successive period t, A(t), might be a 
function containing the product S(t) and Z(t). This product, S(f)/(t), will insure 
that if either variable value is 0, no new infections. A(t) will occur. The additional 
number of people infected in period ¢ is the additional number of people cured 
three periods later assuming there are no deaths. 

Assume A(t) = a S(t) I(t). The parameter ‘a’ is a rate coefficient. 


Fig.15.10 This cartoon, 
titled death’s dispensary, is a 
caricature published during 
the London cholera epidemic 
of 1866 (George J. 
Pinwell/Public domain). 
https://www.cbc.ca/news/ 
canada/newfoundland- 
labrador/apocalypse-then- 
conspiracy-theories- 
1.5792105 
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Denoting C(t) as the number of cured people at the beginning of period t, and 
D(t) the number of dead people at the beginning of period t, where d is the fraction 
that die, then at the beginning of period 1. 


S(1) = 70,000, 
C(1) = 0, 
I1) = 0, 
DA) = 0. 


Mass balance requires that 

Number of susceptible at beginning of period t + 1 = S(t + 1) = S(t) — A(t), 

Number of newly infected in period t = A(t) = min [a S(t) I(t), S(O], 

Number of infected at beginning of period t + 1 = I(t + 1) = I(t) + A(t) — A(t 
=); 

Number of cured at the beginning of period t + 1 = C(t + 1) = C(t) + A(t 
- 3)(1 - d), 

Number of deaths at beginning of period t + 1 = D(t + 1) = D(t) + A(t — 
3)d. 

As a check, S(t) + C(t) + I(t) + D(t) should always equal S(1) which in this 
example is 70,000. 

Assume that in the first week 28 people got the disease. During the next week 
there were 60 new cases. 

Thus 60 = a(28)(70,000-28) and therefore the infection rate constant ‘a’ = 
60/{(28)(70,000—28) = 0.3062449E—04. 

If no one is expected to die, the death rate fraction d will be 0. Assuming d = 
0, the results of this example simulation for 15 weeks are shown in Table 15.6. 


Table 15.6 Results of the disease simulation model 


Time period | Susceptible at Number infected | Number infected | Number cured at 
beginning of time ¢ | during time t at beginning of t | beginning of t 

t S(t) A(t) I(t) C(t) 

1 70,000 28 0 0 

2 69,972 60 28 0 

3 69,912 188 88 0 

4 69,732 590 276 0 

5 69,133 1,775 839 28 

6 67,358 5,269 2,554 88 

7 62,089 14,516 7,634 276 

8 47,573 31,411 21,560 867 

9 16,162 16,162 51,196 2,642 

10 0 0 62,089 7,910 

11 0 0 47,573 22,427 

12 0 0 16,162 53,838 

13 0 0 0 70,000 
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Disease Model 


Population 


1 2 3 = 5 6 7 8 9 10 11 12 13 
TIME 


=~ SuscepS — New inf. A Total inf.| -——<=—Cured. C 


Fig. 15.11 Plot of progression of susceptible, infected, and cured among a population of 70,000 
predicted by the disease simulation model 


Figure 15.11 shows a plot of the data in Table 15.6. 

This would be the first step in identifying the effect of various policies for 
reducing the number of people that get infected or that may die. Vaccination, 
if available, various degrees of isolation from other people, protective clothing 
including masks, and travel restrictions are among alternatives that could reduce 
the infection rate constant or the number of susceptible people in a population, or 
the fraction that die, if any. In addition, the total population of susceptible persons 
could vary either randomly or deterministically, such as due to more tourists during 
certain weeks than others. The parameters ‘a’ and ‘d’ could be random. If so, this 
might suggest a Monte Carlo simulation to obtain probability distributions of the 
number of infected and cured people at any time. 


Exercises 
1. Bus replacement. 


Every year 5% of the passenger buses in Ithaca need to be replaced due to obso- 
lescence and no longer meeting safety and environmental standards. Current plans 
and budget constraints call for the purchase of 10 new busses each year. How many 
busses must the bus company have if these rates of change can be sustained? Is this 
equilibrium stable? 


2. Controlling algal blooms. 


In many lakes algal blooms are an increasing hazard. They are often caused by 
excessive phosphorus, P, entering the lake. 
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Consider a small lake having a constant volume V cubic meters. Thus its inflow 
Q equals its outflow Q. Currently the mass of phosphorus entering the lake is P kg 
per day. The daily rate of phosphorus decay per unit phosphorus mass in the lake is 
defined by the decay constant k. Each of these values, V, Q, P, and k, are known. 

The daily change, dM/dt, of phosphorus mass, M, in the lake depends on the daily 
mass of phosphorus entering the lake, P, the mass of phosphorus that exits the lake 
in the outflow, QM/V, and the mass of phosphorus that decays in the lake, kM. This 
change in lake phosphorus mass can be written as 


dM /dt = P-QM/V—kM. 


(a) Suppose the initial lake nutrient mass at the beginning of day 1, M(1), is 0. Given 
a constant mass of phosphorus, P, entering the lake each day beginning in day 1, 
show how you could determiine the mass of phosphorus, M(t), at the beginning 
of each following day t. 

(b) Will the phosphorus mass in the lake reach an equilibrium, and if so what is it? 

(Express as a function of V, Q, P, and k). 

(c) Suppose the phosphorus entering the lake, P, can be reduced by X percent, This 
would cost C(X). How could you define the tradeoff between this cost, C(X), 
and the equilibrium phosphorus concentration, M/V, in the lake? 


3. Forest sustained yield: 


One measure of the amount of forest growth in the watershed is the basal area of 
trees. This is the cross sectional area of the trunk near the base of the tree. For both 
hardwood and softwood species the increase in basal area per hectare per year is 
directly proportional to the initial basal area of that species. However, this potential 
increase in basal area is reduced by the loss in basal area due to competition from its 
own species and from the other species. 

Let 


H(y) Basal area of hardwoods per hectare at the beginning of year y. 
S(y) Basal area of softwoods per hectare at the beginning of year y. 
r, basal area growth per unit basal area per hectare for species type t. 


a; basal area loss per unit of basal area of species type t per unit basal area of same 
species per hectare. 


b; basal area loss per unit of basal area of species type t per unit basal area of 
different species per hectare. 
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Equations that describe the changes in basal area over time for both tree species can 
be written as 


dH /dy = ry H (y) — ay H? — by HW) SQ), 


dS/dy =rsS(y) — asS(y)? — bs HOSO). 
Assume ry = 0.3; rs = 0.5; ay = 0.1; as = 0.1; by = 0.05; bs = 0.05. 


If this forest is to be managed in a sustainable way to obtain a constant harvest 
of hardwood and softwood in each year, create a model to determine how much of 
each type of species can be harvested each year depending on the relative value per 
unit basal area of hardwoods compared to that of softwoods. 


4. For the epidemic affecting 70,000 people described in this chapter, develop the 
equations needed to simulate the course of the disease over time, keeping track 
of the number of infected in each week, and the number susceptible and cured or 
immune at the beginning of each week. Carry out the simulation and plot graphs 
of the results over time as was shown in this chapter. 
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ABSTRACT 


An introduction to various approaches for modeling systems where different 
policy makers and/or stakeholders have different and often conflicting goals 
regarding desired system performance. 


16.1 Introduction 


Rarely do people and organizations have just one goal they are trying to satisfy. 
Furthermore, some of those goals or objectives may conflict with others. There 
may be no plan or policy that everyone will agree is the best. Just what com- 
bination of objective values is considered best will differ depending on who is 
being asked and sometimes when they are asked. Deciding what to do or policy 
to implement takes place in a political process. The role of modelers is to inform 
the debates that take place in these political decision-making processes. Modelers 
can help identify and evaluate the alternative plans or policies available and define 
the tradeoffs among conflicting stakeholder goals and other measures of system 
performance (Fig. 16.1). 

Given multiple performance criteria measured in different ways, there are a 
variety of modeling approaches that can be used to identify their tradeoffs, if any. 
In this chapter, some ways of including multiple objectives in models are reviewed. 
Multi-criteria or multi-objective analyses are not designed to identify the best solu- 
tion in cases of conflict among these objectives, but only to provide information 
on the tradeoffs between given sets of quantitative performance criteria. Politi- 
cal decisions are likely to be based on qualitative judgements in addition to any 
quantitative information derived from models. They will not be determined by a 
computer or mathematical model, but the political debates that take place prior to 
decision-making can often be guided by the information resulting from analysts 
and their models. 
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Fig. 16.1 Modeling assisting 
stakeholders who want 
different policies, programs, 
and outcomes 


For example, consider the resource allocation problem introduced and solved 
in previous chapters. Each allocation resulted in net benefits. The objective was to 
find the allocations that maximized the total net benefits obtained from all allo- 
cations. A second objective may be to distribute these maximum net benefits in 
an equitable way. Both objectives are measured in monetary units. Even assuming 
everyone may agree to maximize total net benefits, subject to any environmen- 
tal, ecological, legal, and social constraints, not everyone is likely to agree as to 
how those net benefits should be allocated among all the stakeholders. This could 
lead to a decision that does not maximize net economic benefits but a decision 
that seems more acceptable and fairer to all who are impacted by the allocation 
decision. 

A general multi-objective optimization problem can be viewed as having a vec- 
tor of objectives. Let the vector X represent the set of unknown decision-variable 
values that are to be determined, and Zj(X) be a performance criterion or objective 
function whose value is determined by the values of X. Each possible vector of 
feasible values of the decision variables X represents a plan. Each performance 
criterion or objective j is an indicator of the impact of that plan. If all n objectives 
Zj(X) are to be maximized, the model can be written 


Maximize [Z1(X), Z2(X),..., Zj(X),..., Zn(X)]. 


Subject to all the constraints that must be satisfied. 


The objective is a vector consisting of n separate objectives. The constraints 
define the feasible region of decision variable values. 

A vector optimization representation of a multi-objective problem may be a 
concise way of defining a model, but it is not very useful in solving it. Multiple 


16.3 Dominance 209 


max Obj(1) 
max Obj(2) 


Fig. 16.2 Feasible tradeoff frontier among two maximization objectives showing the maximum 
value of one objective given a value of another objective 


objective models can be solved only if their multiple objectives can be reduced 
to a single-objective. Thus, the multi-objective planning problem defined above 
cannot, in general, be solved without additional information and some modeling 
modifications. There are many ways to do this. This chapter introduces some of 
them. 


16.2 Efficiency Concept 


One of the goals of multi-objective planning is to identify technologically efficient 
tradeoffs among mutually exclusive feasible plans. These are plans that are on the 
tradeoff frontier (e.g., ‘b’ in Fig. 16.2). Feasible plans that are not on this frontier 
(e.g., ‘a’) are inferior in the sense that it is always possible to identity alterna- 
tives that will improve one or more objective values without making others worse. 
The goal of multi-objective modelling is the generation of a set of technologically 
feasible and efficient values of all unknown decision variables and objective func- 
tions. An efficient plan is one in which any objective value cannot be improved 
without causing a less desirable value of one or more other objectives. 


16.3 Dominance 


A plan i having multiple decision variable values, X; dominates all others if it 
results in an equal or superior value for all objectives j, Zj(Xi), and at least 
one objective value is strictly superior to those of each other plan. In symbols, 
assuming that all objectives j are to be maximized, plan alternative i, denoted as 
Xi, dominates if all objectives j Zj(Xi)>Zj(Xk) for all plans k and at least one 
objective j* is better for some plan k: Zj*(Xi) > Zj*(Xk). 

It is seldom that one plan i, X;, dominates all others. If it does, choose it! It 
is more often the case that different plans will dominate all the other plans based 
on different objectives. Plan h may be best based on one objective, while plan k 
may be best based on another objective. However, if there exists two plans k and 
h such that Zj(Xk) > Zj(Xh) for all objectives j and for some objective j*, Zj*(Xk) 
> Zj_(Xh), then plan k dominates plan h and plan Xh can be dropped from further 
consideration, at least with respect to the objectives being considered. 

Referring to Fig. 16.3, plan A dominates plan C and hence C can be dropped 
from consideration, at least based on the two objectives shown. While plans A 
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Fig. 16.3 Plot showing three ; 
discrete mutually exclusive Z2(X) A 

plans and their two objective B 
values C 


Zı(X) 


and B are both dominant plans, plan C may be considered best based on other 
considerations or objectives not included in the analysis. If some objectives are not, 
or cannot be, included in the analysis, inferior plans with respect to the objectives 
that are included in the analysis should not be rejected from eventual consideration. 
Dominance analysis can only deal with the objectives being explicitly considered. 


16.4 Satisficing 


Defining which plans are dominant does not help us decide which among those 
dominant plans may be better than others. Satisficing, illustrated in Fig. 16.4, is 
one approach for selecting the best plan among those being considered. 

Assume all objectives are to be maximized. Satisficing requires that the partici- 
pants in the decision-making process specify a minimum acceptable value for each 
objective. Those alternatives whose objective values do not meet these minimum 
acceptable values are eliminated from further consideration. If only one alterna- 
tive meets these minimum requirements, select it as the best. If no alternatives 
have objective values that meet these minimum requirements, either reduce these 
requirements until an alternative meets one or more of them or enlarge the options 
being considered, i.e., enlarge the system. If multiple alternatives remain, those 
that remain can again be screened by increasing one or more of the minimum 
requirements until only one alternative remains, such as shown in Fig. 16.4. When 
used in an iterative fashion, satisficing can identify what the participants consider 
best of multiple alternative plans or policies. 


Fig. 16.4 Plot showing the ZAX) 
objective values of multiple ° x best alternative 
plans illustrating the Te 2 = 
satisficing approach for ° 5 
selecting the best plan go s 
Z(X) 
min acceptable z,* 


values of : 
objectives 
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Of course, sometimes the participants in the decision-making process will be 
unwilling or unable to narrow down the set of available non-inferior plans suffi- 
ciently with the iterative satisficing method. Then it may be necessary to examine 
in more detail the possible tradeoffs among the competing alternatives. 


16.5 Lexicography 


Another simple approach for determining the best alternative is called lexicogra- 
phy. To use this approach, the participants in the decision-making process must 
rank the objectives in order of priority. The plan that is the best with respect to 
the highest priority objective will be the one selected as superior. If there is more 
than one plan that has the same value of the highest priority objective, then among 
this set of preferred plans the one that achieves the highest value of the second 
priority objective is selected. If here too there are multiple such plans, the process 
can continue until there is a unique plan selected. 

Referring to Fig. 16.3, if Z1 is the most important objective, then the best 
decision is B as shown in that figure. If Z2 is the most important objective, then 
the best decision is alternative A. 

This method assumes such a ranking of the objectives is possible. Often the 
relative importance of various objectives may change over time or depend on other 
factors affecting stakeholder or decision-maker opinions. Consider, for example, 
the problem of purchasing an apple or an orange. Assuming you like both apples 
and oranges, which should you buy if you can only buy one? If you know you 
already have lots of apples, but no oranges, perhaps you would buy an orange, 
and vice versa. Hence, the ranking of objectives can depend on the current state 
and needs of those who will benefit from the plan and these states or needs can 
change. 


16.6 Indifference Analysis 


Another method of selecting the best plan is called indifference analysis. To illus- 
trate the possible application of indifference analysis to plan selection, consider a 
simple situation in which there are only two alternative plans (A and B) and two 
planning objectives (1 and 2) being considered. Let Z1A and Z2A be the values 
of the two respective objectives for plan A and Z1B and Z2B be the values of the 
two respective objectives for plan B. This situation can be plotted such as shown 
in Fig. 16.2 where plan C is not being considered. Comparing both plans A and B 
when one objective is better than another for each plan can be difficult. Indiffer- 
ence analysis can reduce the problem to one of comparing the values of only one 
objective. 

Indifference analysis first requires the selection of an arbitrary value for one of 
the objectives, say Z2*, for objective 2. It is usually a value within the range of 
the values Z2A and Z2B, or in a more general case between the maximum and 
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minimum of all objective 2 values. Next, a value of objective 1, say Z1, must be 
selected such that the participants involved are indifferent between the hypothetical 
plan that would have as its objective values (Z1, Z2”) and plan A that has as its 
objective values (Z1A, Z2A). In other words, Z1 must be determined such that (Z1, 
Z2”) is as desirable as or equivalent to (Z1A, Z2A): 


(Z1, Z2*) = (Z1A, Z2A). 


Next, another value of the first objective, say Z 1*, must be selected such that 
the participants are indifferent between a hypothetical plan (Z1", Z2“) and the 
objective values (Z1B, Z2B) of plan B: 


(Z1*, Z2*) = (Z1B, ZR). 


These comparisons yield hypothetical but equally desirable plans for each actual 
plan. These hypothetical plans differ only in the value of objective 1 and, hence, 
they are easily compared. If both objectives are to be maximized and Z1 is larger 
than Z1“, then the first hypothetical plan yielding Z1 is preferred to the second 
hypothetical plan yielding Z1”. Since the two hypothetical plans are equivalent to 
plans A and B, respectively, plan A must be preferred to plan B. Conversely, if Z1“ 
is larger than Z1, then plan B is preferred to plan A. 

This process is illustrated in Fig. 16.5. 

This process can be extended to a larger number of objectives and plans, all 
of which may be ranked by a single common objective. For example, assume that 
there are three objectives Zli, Z2i, Z3i, and n alternative plans i. Consider any plan 
i. A reference value Z3“ for objective 3 can be chosen and a value Z1 estimated 
such that one is indifferent between (Zl, Z2, Z3*) and (ZI, Z2, Z3). 

The second objective value remains the same as in the actual alternative and in 
the hypothetical alternative. Thus, the focus is on the tradeoff between the values 
of objectives 1 and 3. 

Next, a new hypothetical plan containing a reference value Z2“ together with 
Z3” can be created and compared with hypothetical alternative (Z1, Z2, Z3"). The 
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focus is on the tradeoff between the values of objectives 1 and 2 since the third 
objective values are the same. A value of Z1” must be selected along with the 
value of Z1 such that the participants are indifferent between (Z1", Z2, Z3“) and 
(Z1, Z2”, Z3*). 


(Z1*, Z2, Z3*) Æ (Z1, Z2*, Z3*). 


Hence, the participants are indifferent between two hypothetical plans that are 
both equivalent to the actual one. The last hypothetical plans, (Zl, Z2*, Z3“), differ 
only by the value of the first objective. The plan that has the largest value for objec- 
tive 1 will be the best plan. This was identified using only pair-wise comparisons 
among multiple objective values. 

All n plans can be ranked just based on the value of this single-objective. 

All the methods presented so far deal with discrete mutually exclusive plans, 
each defined by known discrete values of their decision variables. The remaining 
methods assume these values are unknown but will depend on the relative impor- 
tance of each objective. Objective values are allowed to vary continuously over 
all possible feasible values. The purpose of these methods is to identify efficient 
combinations of objective values, along with their corresponding decision variable 
values, and the tradeoffs among them. 

Two common approaches for identifying non-dominated plans that together 
identify the efficient tradeoffs among all the objectives Zj(X) are the weight- 
ing and constraint methods. Both methods require numerous solutions of a 
single-objective optimization model to generate points on the objective functions’ 
efficiency frontier. 


16.7 The Weighting Method 


The weighting approach involves assigning a relative weight to each objective and 
adding them together. This converts the objective vector to a scalar. This scalar is 
the weighted sum of the separate objective functions. The multi-objective model 
becomes 


Maximize Z = [w1Z1(X) + w2Z2(X)...+ wjZj(X) ... twIZI(X)). 


Subject to all the relevant constraints. 


The non-negative weights, wj, are constants specified by the modeler. 

The values of these weights, wj, can be varied systematically, and the model 
solved for each combination of weight values, to generate a set of technically 
efficient (non-inferior) plans. 

The foremost attribute of the weighting approach is that the tradeoffs or 
marginal rate of substitution of one objective for another at each identified point 
on the objective function’s efficiency frontier is dependent on the relative weights. 
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The marginal rate of substitution between any two objectives Zj and Zk, at specified 
values of the decision variables X, is 


[dZj /dZk] = wk/wj. 


This applies when each of the objectives is continuously differentiable at the 
point X. 

The relative weights can be varied over reasonable ranges to generate a wide 
range of plans that reflect different priorities among the objectives. Alternatively, 
specific values of the weights can be selected to reflect preconceived ideas of 
the relative importance of each objective. The prior selection of weights requires 
value judgments. As analysts, we are not asking decision-makers to give us their 
preferred relative weights or ranking of objectives. It seems unlikely any decision- 
maker would want to do this for a variety of reasons. We as analysts are picking 
various combinations of weights to identify the efficiency frontier among conflict- 
ing objectives. It is then up to the decision-makers to decide what point on this 
frontier represents the best combination of objective values, and hence the best 
decision variable values. 

If each objective value is ‘normalized’ by dividing by its maximum feasible 
value, then the weights can range from 0 to | and sum to 1, to reflect the relative 
importance given to each objective. Otherwise, if the values of one objective are 
very large compared to the values of another objective, the weight on the lower 
value objective must be much larger than the weight on the higher value objective 
to get any change in the two objective values. 

Fortunately, here we are not concerned with finding the best set of weights, but 
merely using these weights to identify the efficient tradeoffs among the conflicting 
objectives. After a decision is made, the weights that produced that solution might 
be considered the best, at least under the circumstances and at the time when the 
decision is made. They will probably not be the weight values that will apply in 
other places in other circumstances at other times. 

A principal disadvantage of the weighting approach is that it cannot generate 
the complete set of efficient plans unless the efficiency frontier is strictly concave 
(decreasing slopes) for maximization objectives. If the objective value frontier, or 
any portion of it, is convex, as shown in Fig. 16.6, then only the endpoints of 
the convex portions of the efficiency frontier will be identified using the weighting 
method when maximizing. Similarly, for minimizing concave portions of efficiency 
frontiers. These limitations are overcome by using the constraint method. 
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16.8 The Constraint Method 


The constraint method for multi-objective planning can produce the entire set of 
efficient plans for any shape of efficiency frontier assuming there are tradeoffs 
among the objectives. In this method, one objective, say Z,, is maximized subject 
to lower limits, Lj, on the other objectives, 7 4k. The solution of the model, cor- 
responding to any set of feasible lower limits Lj, produces an efficient alternative 
if the lower bounds on the other objective values are binding (Fig. 16.7). 

In its general form, the constraint model is 


Maximize Zk(X). 
Subject to, in addition to the other constraints in the model, 
Zj(X)=Lj Yj#k. 


Note that the dual variables associated with the right-hand-side values Lj are 
the marginal rates of substitution or rate or change of Zk(X) per unit change in Lj 
(or Zj(X) if binding). 

An efficiency frontier identifying the tradeoffs among conflicting objectives can 
be defined by solving the model for many values of the lower bounds. Just as with 
the weighting method, this can be a tedious job if there are many objectives. If 
there are more than three objectives, all the tradeoffs cannot be plotted. Pair-wise 
tradeoffs that can easily be plotted do not always clearly identify non-dominated 
alternatives. 

The number of solutions to a weighting or constraint method model can be 
reduced considerably if the participants in the decision-making process can iden- 
tify the acceptable ranges of the values of weights or lower limits. However, this 
is not the language of decision-makers. Decision-makers who count on the sup- 
port of each interest group represented by each objective are not likely to specify 
weights that imply the relative importance of those various stakeholder interests. 
In addition, decision-makers should not be expected to know what they may want 
until they know what they can get, and at what cost (often politically as much as 
economically). However, there are ways of modifying the weighting or constraint 
methods to reduce the amount of effort in identifying these tradeoffs as well as the 
amount of information generated that is of no interest to those making decisions. 
This can be done using interactive methods that are discussed shortly. 
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The weighting and constraint methods are among many methods available for 
generating efficient or non-inferior solutions. The use of methods that generate 
many solutions, even just efficient ones, assumes that once all the non-inferior 
alternatives have been identified, the participants in the decision-making process 
will be able to select the best compromise alternative from among them. In some 
situations, this has worked. However, in many multi-objective planning situations, 
they are not sufficient in themselves. This is because the number of feasible non- 
inferior alternatives is simply too large. Participants in the decision-making process 
will not have the time or patience to examine and evaluate each alternative efficient 
plan. Participants may also need help in identifying which alternatives they prefer, 
and some may prefer ones that are not on any efficiency frontier, as previously 
discussed. 

There are a few methods available for assisting decision-makers in selecting 
their most desirable non-dominated plan. Some of the more common ones are 
described next. 


16.9 Goal Attainment 


The goal attainment method combines some of the advantages of both the weight- 
ing and constraint plan generation methods already discussed. The participants in 
the planning and management process specify a set of goals or targets Tj for each 
objective j and, if applicable, a weight, wj, that reflects the relative importance of 
meeting that goal compared to meeting other goals. If the participants are unable 
to specify these weights, the analyst must select them and then later change them 
based on their reactions to the generated plans (Fig. 16.8). 

The goal attainment method identifies the plans that minimize the maximum 
weighted deviation of any objective value, Zj(X), from its specified target, Tj. The 
problem is to find the values of the decision variables X and objective function 
values that 


Minimize D 
Subject to, in addition to the other constraints in the model, 
wj[Tj-Zj(X)] <D j=1,2,...,J. 
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This method of multi-objective analysis can generate efficient or non-inferior 
plans by adjusting the weights and targets. If some targets Tj are less than Zj(X), 
some plans generated from goal attainment may be inferior with respect to the 
objective functions being maximized. Again, this model assumes all objectives are 
being maximized. If not, change the terms wj[7j — Zj(X)] in the constraints to wj 
[Zj(X) -Tj]. 


16.10 Goal-Programming 


Goal-programming methods also require specified target objective values, along 
with relative losses or penalties associated with deviations from these target values. 
The objective is to find the plan that minimizes the sum of all such losses or 
penalties. Assuming for this illustration that all such losses can be expressed as 
linear functions of deviations from target values, and assuming each objective is 
to be maximized, the general goal-programming problem is to 


Minimize &j[vj Dj + wj Ej). 
Subject to, in addition to the other constraints in the model, 
Zj(X) = Tj—Dj + Ej for each objective j. 


The parameters vj and wj are the penalties (weights) assigned to objective value 
deficits or excesses, as appropriate. The weights and the target values, Tj, can be 
changed to get alternative solutions, or tradeoffs, among the different objectives. 


16.11 Interactive Methods 


Interactive methods allow participants in the decision-making process to explore 
the range of possible decisions without having to generate all of them, especially 
those of little interest to anyone (Fig. 16.9). 

Some iterative methods begin with an obviously inferior solution. Based on a 
series of questions concerning how much more important it is to obtain various 
improvements of each objective, the methods proceed incrementally from that infe- 
rior solution to more improved solutions. The result is either a solution everyone 
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agrees is best, or an efficient one where no more improvements can be made in 
one objective without decreasing the value of another. 


16.12 Plan Simulation Performance Measures 


The methods outlined above provide a brief introduction to some of the simpler 
approaches available for plan identification and selection. Details on these and 
other potentially useful techniques can be found in many books, some of which 
are devoted solely to this subject of multi-objective planning. Most have been 
described in an optimization framework to focus on those alternatives that are 
considered dominant and efficient. 

This section describes ways of evaluating alternative plans or policies based on 
performance criteria values derived from simulation models. Simulation models of 
systems yield sets of output variable values. These are values of multiple system 
performance criteria, each possibly pertaining to a specific interest and measured 
in its appropriate units. 

There are numerous ways of summarizing sets of output data that might result 
from simulation analyses. Calculating arithmetic or geometric mean values and 
their standard deviations are two ways of summarizing multiple data. Other indi- 
cations of system performance include reliability, resilience, and vulnerability 
measures. 


Reliability 


The notion of reliability requires defining ranges of values of each performance 
criterion or objective that are considered satisfactory and the ranges of values that 
are considered unsatisfactory. The number of simulated values of a performance 
measure in the satisfactory range divided by the total number of simulated values 
is a measure of its reliability. 


Reliability = number of satisfactory values/total number of values. 


Reliability values associated with any objective or performance criterion range 
from 0 to 1. 

Is a system, or model of it, that produces more reliable output over time (e.g., 
the red time series in Fig. 16.10) better than a less reliable (e.g., the green time 
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series) system? Reliability measures tell one nothing about how quickly a system 
that produces an unsatisfactory output value recovers and returns to producing 
satisfactory values, nor does it indicate how bad an unsatisfactory value might 
be should one occur. It may well be that a system that fails relatively often, but 
by insignificant amounts and for short durations, will be preferable to one whose 
reliability is much higher but when a failure does occur, it is likely to be much 
more severe and take longer to return to a satisfactory state. 

Resilience and vulnerability measures can quantify these vulnerability and 
resilience system characteristics. 


Resilience 


Resilience can be defined as the probability that if a system output value is unsat- 
isfactory, the next value will be satisfactory. It is the probability of having a 
satisfactory value in period t + 1, given an unsatisfactory value in any period 
t. It can be calculated as 


Resilience = [number of times a satisfactory value follows an unsatisfactory value | / 


[number of times an unsatisfactory value occurred]. 


Resilience ranges from 0 to 1 and is not defined if no unsatisfactory values 
occur in a particular time series. 


Vulnerability 


Vulnerability is a measure of the extent of the differences between the thresh- 
old value, T, that divides values into satisfactory and unsatisfactory ones, and the 
unsatisfactory values. Clearly, this is a probabilistic measure since such deviations 
from the threshold value will differ. Some analysts use expected values, some use 
maximum observed values, and others may quantify vulnerability in terms of a 
probability of exceedance distribution. 

Assuming an expected value measure of vulnerability is to be used: 


Vulnerability[deviation] = [sum of unsatisfactory deviations from threshold T| / 


[number of times an unsatisfactory value occurred], 


Vulnerability[duration] = [sum of failure durations]/ 


{number of failure events]. 
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Fig. 16.11 Two time series of values of a particular performance measure 


An Example: 


For an example consider the two hypothetical time series of values of a perfor- 
mance measure shown in Fig. 16.11. They have the same mean, 4.6, and the same 
variance, 7.66. One is just the 180-degree rotation of the other about the mean. 
Hence if the objective being maximized was the mean, or if the objective being 
minimized was the variance, both series would give identical values of those objec- 
tives. However, their reliability, resilience, and vulnerability measures differ. There 
are tradeoffs among them. 

Just looking at Fig. 16.11, we can see that the reliability of the red series is 
70%. The blue series reliability is 90%. 

The resilience of the red series is 33%. The blue series resilience is 100%. If 
vulnerability is based on maximum failure, that of the blue series is greater than 
that of the red series. If vulnerability is based on maximum duration that of the 
red series is greater than that of the blue series. 


Exercises 


1. Determining efficiency frontiers by weighting and constraining multiple objec- 
tives: 

(a) Express the following model in a form used for defining the efficiency frontier 

(tradeoff between the two objectives) using the weighting method and the 


16.12 Plan Simulation Performance Measures 221 


constraint method. 


Maximize Z; = 4X)— X2 
Maximize Z2 = —2X; + 6X2 
Subject to : 

Xı< 4 


(b) Plot the efficiency frontiers in decision (x; vs. x2) and objective (zı vs. z2) 
spaces. 


2. Resource allocation 
Consider again the resource allocation problem where three users obtain benefits 


B(X) from the resources X they get allocated to them. The functions B(X) and their 
maximum values are shown below. 


Function Optimal X Optimal value of function 
B\(X1) = 6X1- X X =3 B\(3) = 9 
By(X2) = 7X2 — 1.5X2 X2 = 7/3  B(1/3) = 147/18 
B3(X3) = 8X3 — 0.5X3 X3=8 B3(8) = 32 


Instead of finding the values of each allocation that maximizes the total benefits, 
assuming only 6 resources are available, each user wants to maximize their own 
benefits. This is now a multi-objective problem. Show how to find the tradeoffs among 
each user using the weighting, constraint, goal attainment and goal-programming 
methods. 


3. Reliability, resilience, and vulnerability performance measures: 


Generate a time series of random variable values from a probability distribution 
you select and for a specified threshold value separating satisfactory values from 
unsatisfactory values determine values of reliability, resilience, and vulnerability. 


4. A multiple objective optimization problem: 


Show how you could use the weighting and constraint and goal attainment methods 
to identify the tradeoff among various maximum values of Z1 and Z2. 


Maximize Z1 
Maximize Z2 
Z1=2x. 

Z2 = 3Y. 

X? +Y? < 16: 
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ABSTRACT 


The chapter introduces methods of quantifying and modeling qualitative state- 
ments concerning system objectives and constraints and thus enabling the 
analyses of such systems. 


The precise quantification of many system performance criteria and parameter and 
decision values is not always possible. Nor is it always necessary. When the values 
of variables cannot be precisely specified, they are said to be either uncertain or 
fuzzy. If the values are uncertain, probability distributions may be used to quantify 
them. Alternatively, if they are best described by qualitative adjectives, such as dry 
or wet, hot or cold, clean or dirty, and high or low, fuzzy membership functions 
can be used to quantify them. Both probability distributions and fuzzy membership 
functions of these uncertain or qualitative variables can be included in quantitative 
optimization and simulation models. This chapter focuses on fuzzy optimization 
modeling, again for the preliminary screening of alternative possible decisions. 


17.1 Introduction 


Large, small, pure, polluted, satisfactory, unsatisfactory, sufficient, insufficient, 
excellent, good, fair, poor, etc. are adjectives often used to describe various values 
of performance measures of some systems. These descriptors do not have ‘crisp’ 
well-defined boundaries that separate them from other values of the performance 
measures. A particular mix of economic and environmental impacts may be more 
acceptable to some and less acceptable to others. Plan A is better than Plan B. 
The water quality and temperature must be good for swimming. These qualitative 
descriptors convey information despite their imprecision. 

This chapter illustrates how these qualitative descriptors can be quantified and 
used in optimization models. Before this can be done some definitions are needed. 
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17.2 Fuzzy Membership Functions 


Consider a set A of numbers ranging from say 18 to 25. Thus A = [18, 25]. In 
classical (crisp) set theory any number x is either in or not in the set A. The 
statement ‘x belongs to A’ is either true or false depending on the value of x. The 
set A is referred to as a crisp set. If the limits of set A are uncertain, one may not 
be able to say for certain whether any number x is or is not in the set. The degree 
of truth attached to that statement is defined by a membership function value rather 
than a probability distribution. But unlike a probability distribution, the value of 
this function ranges from 0 (definitely false) to 1 (definitely true). (It could range 
from 0 to 10, as suggested in Fig. 17.1). 

Consider the constraint: “The water temperature in a community swimming 
pool should be suitable for swimming.” Just what temperatures are suitable will 
vary depending on the person asked. It would be difficult for anyone to define 
precisely those temperatures that are suitable if it is understood that temperatures 
outside that range are absolutely not suitable. This uncertain range of suitable 
temperatures is called a fuzzy set. Its boundaries are ‘fuzzy.’ 

A membership function defining the interval or range of water temperatures 
suitable for swimming is shown in Fig. 17.2. Such functions may be defined 
based on the responses of many swimmers. There is a zone of imprecision or 
disagreement at both ends of the range. 

The form or shape of a membership function depends on the individual sub- 
jective feelings of the “members” or individuals who are asked their opinions. To 
define this particular membership function, each individual i could be asked to 
define his or her comfortable water temperature interval (Tli, T2i). The mem- 
bership value associated with any temperature value T equals the number of 
individuals who place that T within their range (T 1i, T2i), divided by the number 
of individual opinions obtained. The assignment of membership values is based 
on subjective judgments, but such judgments seem to be sufficient for much of 
human communication and decision-making. 
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Fig.17.2 A fuzzy membership function for suitability of water temperature for swimming 


17.3 Optimization in Fuzzy Environments 


Consider the problem of finding the maximum value of x given that x cannot 
exceed 11. This can be written as 


Maximize U = x. 


Subject to : 
x <11. 


The obvious optimal solution, x = 11, is shown in Fig. 17.3. 

Now suppose the objective is to obtain a value of x that is substantially larger 
than 10 while making sure that the maximum value of x should be in the vicinity 
of 11. This is no longer a crisp optimization problem; rather it is a fuzzy one. 

What is perceived to be substantially larger than 10 could be defined by a 
membership function, again representing the results of an opinion poll of what 
individuals think is substantially larger than 10. 


Fig.17.3 A plot of the crisp 
optimization problem of 
maximizing U but it cannot 
exceed 11 
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Suppose the membership function for this goal, mG(x), reflecting the results of 
such a poll, can be defined as 


mG(x) = 1/{1 + [1/(x—10)7]} ifx > 10, 
mG(x) = 0 otherwise. 


This function is shown in Fig. 17.4. 

The constraint on x is that it ‘should be in the vicinity of 11.’ Suppose the 
results of a poll asking individuals to state what they consider to be in the vicinity 
of 11 results in the following constraint membership function, mC(x): 


mC(x) = 1/[1+ (x—-1) ‘41. 


This membership function is shown in Fig. 17.5. 

Recall the objective is to obtain a value of x substantially larger than 10 while 
making sure that the maximum value of x should be in the vicinity of 11. In this 
fuzzy environment the objective is to maximize the extent to which x exceeds 10 
while keeping x in the vicinity of 11. The solution can be viewed as finding the 


Fig. 17.4 Membership function defining the fraction of individuals who think a particular value 
of x is ‘substantially’ greater than 10 


Fig.17.5 Membership function representing the vicinity of 11 
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Fig.17.6 The intersection membership function and the value of x that represents a fuzzy optimal 
decision 


value of x that maximizes the minimum values of both membership functions. 
Thus we can define the intersection of both membership functions and find the 
value of x that maximizes that intersection membership function. 

The intersection membership function is 


mD(x) = Maximize minimum{mG(x), mC(x)} 


a/a + [1/(x—10)7]), 1/0 + @—11)4)} if x > 10 
= 0 otherwise. 


This intersection set, and the value of x that maximizes its minimum value, is 
shown in Fig. 17.6. 

This fuzzy decision is the value of x that maximizes the intersection member- 
ship function mD(x), or equivalently: 


Maximize mD(x) = maximize the minimum of{mG(x), mC(x) }. 


The optimal solution is x = 11.75 and mD(x) = 0.755 which is the value of 
both membership functions mG(x) and mC(x). 


17.4 Fuzzy Sets in Resource Allocation 


Assume you are employed as a water manager in a state department of conserva- 
tion. You deal with water allocation as well as pollution control policies. 

This water resource allocation problem is illustrated in Fig. 17.7. 

Assume, as in the previous allocation examples, the problem is to find the 
allocations of water to the three firms that maximize the total benefits TB. 


Maximize TB = (6x)—x}) + (7x2—1.5x2) + (8x3— 0.5x3). 


228 17 Fuzzy Optimization 


firm 2 
By = 7x -1.5xq? 


Fig.17.7 Three firms i that obtain benefits B; from their allocations x; of water 


These allocations cannot exceed the total water available, R. Assuming R = 
6, the crisp optimization problem is to maximize TB subject to the resource 
constraint: 


xı + x2 + x3 <6. 


The optimal solution is x; = 1, x2 = 1, and x3 = 4 as previously obtained using 
different optimization methods. The maximum total benefits, TB, equal 34.5. 

Instead of assuming the available amount of water is certain to be R = 6, 
assume it is “about 6 units more or less”. This statement defines a fuzzy constraint. 
Assume the membership function describing this fuzzy constraint is defined by 


mC =1ifR <5, 
mC = [7—R]/2if 5< R <7, 
mC = O0ifR>7, 


as is shown in Fig. 17.8. 

Converting the total benefit function, TB, to a fuzzy function, mG, ranging 
linearly from 0 to 1 when at its maximum unconstrained value of 49.17, the fuzzy 
optimization problem becomes 


Maximize minimum (mG, mC) 
or equivalently: 


Maximize m 
m<mG 


m <mC 
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Fig. 17.8 Membership function for ‘R is about 6 units more or less’ 


Subject to: 


mG = [(6x1—x?) + (7x2-1.5x3) + (8x3— 0.5x3)1/49.17, 
mC = [7-R]/25 < R <7, 
xX) + x2 + x3 <R. 


Solving this model to find the maximum of a lower bound m on each of the 
two membership functions, the optimal fuzzy decisions are xj = 0.91, x2 = 0.94, 
x3 = 3.81, mC = mG = 0.67, and the total net benefit, TB = 33.1. Compare this 
with the crisp solution of x; = 1, x2 = 1, x3 = 4, and the total net benefit of 34.5. 


Water pollution control. 


Consider the stream pollution problem illustrated in Fig. 17.9. The stream receives 
waste from sources located at sites 1 and 2. Without some waste treatment at 
these sites, the pollutant concentrations at sites 2 and 3 will exceed the maximum 


Fig.17.9 Two firms discharging their wastes W into a river upstream of a park. The problem is 
to find the waste removal efficiencies (x1, x2) that result in meeting the stream quality standards at 
least-cost 
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Table 17.1 Parameter values selected for the water quality management problem illustrated in 
Fig. 17.9 


acceptable concentration. The problem is to find the level, xi, of wastewater treat- 
ment (fraction of waste removed) at sites i = 1 and 2 required to meet the quality 
standards at sites 2 and 3 at a minimum total cost. The data used for the problem 
shown in Fig. 17.9 are listed in Table 17.1 

The crisp model for this problem is 


Minimize C1 (x1) + C2(x2). 


Subject to: 
Water quality constraint at site 2: 


[Pi Q1 + Wi(1 — x1)la12/ Q2 < Px 
[(32)(10) + 250000(1 — x1) /86.4] 0.25/12 < 20 


which when simplified is x; > 0.8. 
Water quality constraint at site 3: 


{[P1 Q1 + Wil — x1)]a13 + [W2(1 — x2)]a23}/ Q3 < P3™* 
{[ G2)(10) + 250000(1 — x1) /86.4] 0.15+ 
[80000(1 — x2) /86.4] 0.60} /13 < 20 


which when simplified is xj + 1.28, x2 > 1.79. 
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Fig.17.10 Membership function for a maximum concentration of ‘about 20 mg/l.’ 


Restrictions on fractions of waste removal: 
0 < x; < 1.0 for sitesi = 1 and 2. 


For a wide range of reasonable costs, the optimal solution found using linear 
programming is xj = 0.80 and x2 = 0.77 or essentially 80% removal efficiencies 
at sites 1 and 2. 

But what if the problem were stated in another way? Suppose the maximum 
allowable pollutant concentrations in the stream at sites 2 and 3 were expressed 
as ‘about 20 mg/l.’ Obtaining opinions of individuals of what they consider to be’ 
about 20 mg/l, a membership function can be defined. Assume it is as shown in 
Fig. 17.10. 

Next, assume that the government environmental agency expects each polluter 
to install best available technology (BAT) or to carry out best management prac- 
tices (BMP) regardless of whether or not this is required to meet stream quality 
standards. Asking experts just what BAT or BMP means with respect to treatment 
efficiencies could result in a variety of answers. These responses can be used to 
define membership functions for each of the two wastewater treatment efficiencies 
in this example. Assume these membership functions for both are as shown in 
Fig. 17.11. 

Finally assume there is a third concern and that is expressed having to do with 
equity. It is expected that no polluter should be required to treat at a higher effi- 
ciency, more or less, than the other polluter. A membership function defining just 
what differences are acceptable or equitable, could quantify this concern. Assume 
such a membership function is as shown in Fig. 17.12. 

Considering each of these membership functions as objectives to be maximized, 
a fuzzy multi-objective optimization model can be defined. One approach is to 
find the treatment efficiencies that maximize the minimum value of each of these 
membership functions. 


Maximizem = min{mp, mT, mg}, 
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Fig.17.11 Membership function defining the waste removal efficiencies associated with the best 
available treatment technology or best management practices 


Fig.17.12 Equity membership function in terms of the absolute difference between the two 
treatment efficiencies 


which is equivalent to 


Maximize m 


where 


m <™mp, 
m < mT, 


m < mE. 


If we assume that the pollutant concentrations at sites j = 2 and 3 will not 
exceed 23 mg/l, the pollutant concentration membership functions mPj are 


mpj = l — p2j/5. 
The pollutant concentration at each site j is the sum of two components: 


Pj = pij + Paj, 
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where 
Pi < 18, 
p2j < (23 — 18). 


Assuming the treatment plant efficiencies will be between 70 and 90% at both 
sites i= 1 and 2, the treatment technology membership functions my; are 


mri = (x2i/ 0.05) — (x4i/ 0.10), 


and the treatment efficiencies are 
xi = 0.70 + x2) + x3 + x4 
where 


xz; < 0.05, 
x3i < 0.05, 
x4i < 0.10. 


Finally, assuming the difference between treatment efficiencies will be no 
greater than 14, the equity membership function, mg, is 


mp = Z— (0.5/0.05)D1 + 0.5(1 — Z) — (0.5/(0.14 — 0.05)) D2, 


where 


D1 < 0.05Z, 

D2 < (0.14 — 0.05) (1 — Z), 

xı—x2 = DP—DM, 

DP + DM = D1 + 0.0501 — Z) + D2, 


Z is a binary 0, 1 variable. 


The remainder of the water quality model remains the same: 
Water quality constraint at site 2: 


[PiQ) + Wid — x1)] a12/ Q2 = P», 
[(32)(10) + 250000(1 — x1)/86.4] 0.25/12 = Pp. 


Water quality constraint at site 3: 


{[P1 Q1 + Wid — x1)]ai3+[W2(1 — x2)]a23}/ Q3 = P3, 
{[ (32)(10) + 250000(1 — x1) /86.4]0.15 + [80000(1 — x2) /86.4] 0.60} /13 = P3. 


Restrictions on fractions of waste removal: 
0 < x < 1.0 for sitesi = 1 and 2. 


Solving this fuzzy model yields the results shown in Table 17.2. 
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Table 17.2 Solution to 
fuzzy water quality 
management model 


Maximum membership values: 0.93 for all my and mp, 1.0 for 
mE 


Treatment efficiencies: 0.81 


Pollutant concentrations: 18.28 at site 2, 18.36 at site 3 


17.5 Summary 


Optimization models incorporating fuzzy membership functions are sometimes 
appropriate when only qualitative statements are made when specifying objectives 
and / or constraints of a particular problem or issue. This chapter has shown how 
fuzzy optimization can be applied in such situations. 


FUZZY MATH 
64 _ 4 
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Exercises 


1. Consider the problem of heating a swimming pool. You are told to maintain 
the right temperature, T, and not spend too much money, C(T), doing it. How 
might you develop a fuzzy model for determining the ‘best’ temperature and 
cost? Assume you know the cost function C(T). Draw and quantify the member- 
ship functions and develop the optimization model that maximizes the minimum 
membership value. 


2. Water Quality Management Model 
Exercise 7 in Chap. 7 involved finding the ‘least-cost’ amounts of wastewater 
treatment (treatment efficiencies) at sites 1 and 2 that meet stream quality stan- 
dards at sites 2 and 3: Currently there is no treatment. All the wastewater is 
discharged into the stream. 


Wastewater: 
100 kg/day 


Site 3 


Wastewater: 
200 kg/day 


Current Pollutant Concentrations: 58 mg/l 95 mg/l 
Maximum Allowable Concentrations: 18 mg/l 23 mg/l 
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Available Data: 
Stream flow = 1000 m?/day at all sites. 1 kg/day/1000 m?/day = 1 mg/l; 
Fraction of waste discharged into stream at site 1 that reaches site 2: 0.25 
Fraction of waste discharged at site 1 that reaches site 3: 0.15 
Fraction of waste at and discharged into stream at site 2 that reaches site 3: 0.60 
Limits of treatment: removal of 30% required, but no more than 90%, for both 
sites. The initial concentration just upstream of site 1 is 32 mg/l. 
Assume the cost of waste removal are 30*fraction removed at site 1 and 
20* fraction removed at site 2. 
Can you find a solution that keeps the stream clean yet doesn’t cost too much? 
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ABSTRACT 


Concluding thoughts on the successful implementation of modeling in policy 
making processes and the relationships between analysts and policy makers. 


Successfully understanding the methods presented in this book has given you 
some skills in model building and obtaining solutions from these models. How- 
ever, this alone will not necessarily help you apply and implement such tools in 
practice. In addition to modeling skills, systems analysts working within or for 
organizations that make decisions need to know how to effectively inform those 
within those organizations or agencies who make or recommend decisions and 
thus can benefit from modeling designed to identify and evaluate possible alterna- 
tives. This requires building trust, and an awareness of, and being responsive to, 
the often-changing information needs of those who recommend or make decisions 
(Fig. 18.1). 

Analysts, especially those engaged in informing policymakers, need to be good 
communicators. This involves making their results transparent by specifying the 
assumptions upon which the results are based and by addressing the uncertainties 
and alternatives openly, taking into account the different interests, goals, and per- 
spectives of stakeholders, and policymakers. Part of being good communicators 
is recognizing that many terms analysts use, such as the word “model,” can mean 
different things to others. Analysts attempting to communicate effectively to others 
should be aware of this need to speak the language their audiences understand. 

What do policymakers expect from analysts? One might think they would like 
definitive advice on what to do, what plan or policy to choose, what action to 
take, and when, backed up by scientific evidence supporting that position. How- 
ever, most know that models can by definition answer or address only ‘what if? 
analytical questions, not the normative ones. A push for decisive decisions not 
only overlooks uncertainty but lies beyond the competence of analysts to deliver 
under the label of “science.” Furthermore, analysts working on policy issues can 
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Fig. 18.1 Informing the 
political process is itself a 
political process 


“Guilty: for getting involved 
in political processes!” 


discover “inconvenient truths,” i.e., model results that might make an otherwise 
popular policy undesirable and therefore complicate a policy response or force 
a politically sensitive conclusion. Such a situation can cause two problems. One 
is the difficulty of communicating unexpected, disturbing results policymakers do 
not want to hear, thereby creating difficulties for them and possibly disrupting the 
relationship scientists have with them. The other is the dilemma of whether to 
make public (publish) such results, which can understandably be motivated by a 
sense of responsibility towards the public, as well as one’s career as an objective 
analyst. 

Informing, i.e., knowing what to present, and how and when, is learned through 
collaboration that generates a mutual understanding and trust between systems ana- 
lysts and their clients. Far less effective is the ad hoc modeling results “delivered 
by parachute’, by an outside expert or firm, either unsolicited, or in a rush when 
policymakers suddenly ask for the modeling results analysts may or may not have. 
This especially applies when a sufficient level of trust has not been developed 
between the analysts and their client policymakers. Useful evidence comes from 
collaborative, continuous, long-term relationships with policymakers and their staff 
throughout a policy making process. This is one reason why there is a tendency for 
policy making agencies to select the same consulting firms to provide the scientific 
evidence desired over time. They have learned to trust them. 

To be relevant to, and imbedded in, policy making processes, analysts must 
build up that trust and be aware of, if not engaged with, the world in which alter- 
native policies and stakeholder values are considered, debated, and where choices 
are made. This is a world where simple opinions and anecdotes coming from 
groups having different interests, perspectives, and power asymmetries, and even 
false information, can influence final decisions. 

Yet policies chosen without sufficient supporting scientific evidence are more 
likely to fall short of being as successful as they could be. An excellent example 
of this is the observation that measures taken to increase the efficiency of water 
used for irrigation so that the savings could be beneficially used elsewhere often 
have just the opposite impact. They simply motivate enlarging the areas irrigated. 
In this case one could argue the policy to increase irrigation efficiency in order to 
provide more water for other uses might have been informed by analyses, but if 
so the analyses were not sufficient. They did not consider the whole system, or 
in fact human behavior. While any policy may result in surprising outcomes, not 
foreseen when the policy was implemented, the scale and likelihood of adverse 
consequences stemming from non- or incomplete evidence-informed decisions are 
much higher. 
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Warren: Do 

you think we are Maybe, but not 

ready to show before we make 
our results? them relevant. 


Fig. 18.2 Model outputs by themselves are rarely ready for prime time. Informing policymakers 
requires translating those outputs to what is desired and understood by, and relevant to, them 


This irrigation story highlights the need for an iterative adaptive policy 
modeling—decision-making process. Once analysts start working on identifying 
alternatives, they may realize that they forgot to include some important criteria 
or constraints, requiring them to go back and update their models and data and 
continue through the process again, such as illustrated in Fig. 1.1 in Chap. 1. Each 
of these steps should be done with the decision-maker(s) and the stakeholders, 
ideally in a shared collaborative and open process. 

Part of the art of modeling is deciding what to model, and in what detail. There 
is no reason to think the first attempt will be the right one. Feedback from those 
being informed by the modeling exercise will almost always motivate modifica- 
tions in any systems model. One can only hope that by the time a decision must be 
made, the modeling results have succeeded in promoting the understanding desired 
and needed by those responsible for making decisions (Fig. 18.2). 
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Exercise Solutions 


1. Analyzing Public Decisions 


1. Why develop and use models? 
Either to better understand the system and how it functions, or for predicting 
the performance of the system under alternative inputs and other assumptions. 
To inform decisions. 
2. Under what conditions is modeling useful to managers (decision-makers)? 
A decision needs to be made. 
There exist many alternatives. 
The best alternative is not obvious. 
The problem or issue is at least partially quantifiable. 
3. What is a measure of modeling success? 
Whether the results of the analyses influenced the debate on what decision to 
take. 
Whether the system and its performance are better understood. 


2. Public Sector Systems 


1. General. 

Under what conditions might it be appropriate to apply systems modeling 

methods? 

e An “innovative” agenda has support in a decision-making institution, 
whether local or national or international. 

e The inclusion of stakeholders, i.e., the public, in decision-making is possible 
and a priority. 
Satisfying stakeholder interests is an institutional goal. 
There is sufficient trust and capacity in government to think outside the 
box, i.e., to experiment. 

e Problems are complex enough to be difficult to address within single 
disciplinary or institutional silos. 
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è There exist one or more champions (persons or institutions) committed 


leading the study and able to implement change. 
e There exists sufficient funding and time and data and expertise to perform 


the analyses. 
2. What is the purpose of developing and using these modeling methods? 


To inform the decision-making processes. 
To improve one’s understanding of how a system performs. 
3. How would you develop a conceptual network representation of the interdepen- 


dence among components of water, land, energy, climate, and socio-economic 


systems? 
One example: 


Thermal cooling, 


diar ad 


ation 


Precipit, 


Thermal Food 


cooling needs, 


ny, 
hydropower, z 
energy demands, 
wind, solar 


Calvin KV, P Patel, L Clarke, G Asrar, B Bond-Lamberty, RY Cui, A Di Vittorio, K Dorheim, J Edmonds, C Hartin, M 
Hejazi, R Horowitz, G lyer, P Kyle, S Kim, R Link, H McJeon, SJ Smith, A Snyder, S Waldhoff, and M Wise. 2019. 
“GCAM v5.1: Representing the linkages between energy, water, land, climate, and economic systems.” 
Geoscientific Model Development 12:677-698, https://doi.org/10.5194/gmd-12-677-2019 (CC BY 4.0) 


3. Developing Models 


1. If })j-5.4 AG) is A(2) + AG) + A(4), write out the sum: `i- 13 ae Xij. 


= X11 + X21 + X22 + X31 + X32 + X33 
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2. Given that )~, represents a sum and []/_, represents a product of n terms, 
what is the value of 


3 4 6 

Vet w ? 

i=1 j=1 k=2 

=[(1+ D*( 4+ 2)*(14+ 374+ 4) + (24 1)7(2 4 2)*(2 + 3)*(2 +4) 
+ 34+ 1)*G+2)*8+3)*G4+4) 

= (2*3*4*5) + (3*4*5*6) + (4*5*6"7) = 1320 

1320/(2 +3 +4+ 5+6 = 20.) = 66 


3. Construct a conceptual model (a picture or a node-link network) of a multiple 
component system. Then identify what decisions are to be made and potential 
objectives or measures of performance. 

Example solution: A transportation system having multiple ways of traveling 
between where you are and where you want to go. 
A conceptual model showing alternatives. 


Origin Destination 


Known: Cost or time or some other attribute associated with each type of travel 
(e.g., car, rail, bus, air plane) on each link. Decision: Which type and route of travel 
to select. 


4. Define the ‘modeling process’ in your own words. 

e Identify system and its components and interrelationships, decision variables, 
constraints, boundary conditions, input data 
Establish goals or objectives—performance or evaluation criteria 
Define relationships between decision variables and objectives, constraints 
Identify solution procedure and modify model as required 
Solve model and perform sensitivity analyses of assumptions 
Modify any previous step and redo remaining steps based on feedback from 
client or new information. 


5. What are the possible sources of uncertainty in any planning or management 
model and how can one deal with them? 
Sources of uncertainty: 
e Model input data. 
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© Model parameters and their values. 
© Model itself. 

Dealing with them: 

© Perform sensitivity analyses. 

© Include probabilities within model. 


6. Distinguish between simulation and optimization. 

e Simulation: Addresses ‘What if’ questions. What will the values of perfor- 
mance measures be given a system design and operating policy, and the input 
data? 

© Optimization: Addresses ‘What should be’ questions based on system objective 
and model assumptions. What are the ‘best’ decisions given the objective(s) 
being maximized or minimized and other assumptions. 

Simulation is used for determining system performance associated with spec- 

ified values of all model variables and parameters. Optimization is often used 

for the preliminary screening of alternatives to determine a set of good deci- 
sion variable values that can then be simulated to determine more precisely the 
system performance. 


7. Identify some pitfalls of modeling. 

e Believing the model really reflects the real world and not questioning the 
results. 

e Not addressing the real issues of policymakers or stakeholders, or not pro- 
viding the information needed when it is needed and at the right level of 
detail. 

e Inadequate calibration, verification. 


8. Consider the following five alternative plans for providing for more security 
and better road maintenance. Whatever the units of performance are, they differ. 
Assume the alternative plans are all feasible, i.e., can be implemented but only 
one is to be selected. 


Alternative Security benefits Road maintenance costs 
A 25 30 
B 10 35 
C 20 32 
D 15 21 
E 5 25 


Which alternative would be the best in your opinion and why? Why might a 
decision-maker select alternative E even realizing other alternatives exist that 
can give more security and road maintenance? 
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Security 


5 E 


10 15 20 25 30 35 


Road Maintenance Costs 


The alternatives that are efficient in that you must give up some of one benefit 
to get more of another are easily seen on a security vs road maintenance plot of 
these five alternatives. Alternatives A and D are efficient. Based on these two types 
of benefits, the selection would be one of these two alternatives. If either B, C, or E 
are chosen, clearly other objectives are being considered. 


9. Define a mathematical model for finding the dimensions of a cylindrical tank 
that minimizes the total cost of storing a specified volume of maple syrup. 
What are the unknown decision variables? What are the model parameters? 
How would you solve this model? 

Decision variables are radius r and height h. Their values are to be determined. 
Parameters are pi (approximately 22/7), the costs per unit area for side (Cs), 
for top (Ct) and for base (Cb) and the required Volume. Their values are known. 


Model: 
Minimize Totalcost (‘Totalcost’ is an unknown variable.) 
Subject to: 
Totalcost = Sidecost + Topcost + Bottomcost (all unknown variables.) 
Sidecost = Cs 2(pi)r h 
Topcost = Ct(pi)r? 
Bottomcost = Cb(pi)r? 
(pi) rh > Volume 


The number of variables in this model can be reduced by combining the 
first four definitional constraints so just r and h are the unknown variables. 
These additional variables and constraints are added for clarity, especially in 
interpreting the model output. 
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One way to find a good solution is to assume an initial r and h that does 
not provide the required volume. Next define an increment of r, Ar, and an 
increment of h, Ah. Then determine which increment to add to the total r or 
h already existing that has the greatest volume increase per cost increase, 
AVolume/ATotalcost. Continue until the required volume is obtained. Alter- 
natively, one can add incremental values of r and h to keep the side cost as 
close to 2/3rds of the total cost as possible, but this fact is not generally known. 
Again, keep adding increments until the volume constraint is satisfied. 


4. Modeling Examples and Solutions 


1. As the supervisor of a town, you are responsible for allocating money to dif- 
ferent public agencies serving the town. The allocations have been based on 
political, not economic, criteria. Each agency is expecting to get at least what 
they got last year, but because of the loss of tax revenue, you do not have as 
much money to distribute as you did before. 

(a) State what you think would be a fair way to allocate the limited funds you 


have. 
In other words, what would be your criterion for allocating funds that 
you could defend at a public hearing? 


Possible objectives: 


Minimum sum of squared deficit deviations. 

Minimum sum of percentage deficits. 

Minimum maximum deficit, 

Minimum maximum percentage deficit. 

Minimum weighted sum of values of above criteria or components of any criterion. 


There could be more. 


(b) 


Develop a model that when solved would identify the allocations that meet 
your objective. Clearly define the variables and parameters you use, and the 
objective function and constraints. 


Let A(i) be the allocation to agency i. (unknown) 


Let T (i) be the" target" allocation each agency i expects or wants. (known) 
Minimize DTO —A(i))* or (TH — A(i))/T@) 

or (T —A@)/TO)’] 
or Minimize Maximum [(T (i) — A@) or (T@ —AQ@)/TW)] 

Budget constraint : So AW < B; Bis the known available budget. 


l 
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In this case since the total desired amount (sum of all T(i))is > B, 


the constraint could be written as yo Aw =B. 


t 


2. Blueberries 


There are three farmer’s markets that sell organically and locally grown blueber- 
ries. The farmer who grows these blueberries gets 90% of the income from their 
sales; the markets get the other 10%. The demand for blueberries differs at each 
market. Some smart economist has determined that the demand (unit price) func- 
tions for blueberries at the three markets (m = 1,2,3) are 6/(1 + Q1), 7/1 + 
1.5Q2), and 8/(1 + 0.5Q3), respectively. 


Demand functions for blueberries. 


Qm 


Unit 
price 


At each market m the unit price varies each week depending on the amount 
of blueberries available, Qm, to be sold. How should the farmer distribute a crop 
ranging from 1 to 6 bushels of blueberries each week to maximize the total amount 
of income received from all three markets? 


(a) Construct an optimization model and solve it using the hill climbing method, 
assuming integer bushel allocations. Identify the best distribution of 1 to 6 
bushels. 

(b) Based on the results of this hill climbing method sketch a maximum revenue 
function for the farmer based on the total amount of blueberries available to 
send to the three markets. 
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Solution: 

a) Max TR, TR=TRi+TR2+TR3 
TRi = 6Q1/(1+Q1), TR2 = 7Qo/(1+1.5Q2), TR3 = 8Q3/(1+0.5Q3) 
Q:+Q.+Q;<6 


Bushels of Blueberries 
Revenue Obtainable O 1 2 3 4 5 6 


Max Revenue Distribution 
Total Q: Q Qs TR 


1 1 5.3 

2 I oO 2 8.3 

3 1 A £ (3S1 

4 1 1 2 138 

5 1 1 3 154 

6 1 1 4 165 
Pick largest difference Represents 3 concave functions 
between above total having $ values as shown on left. 
revenues (i.e., largest 
marginal revenue 
gain) when adding to 
previous allocations. 

b) 


Plot TR vs Total (Sketch) 90% of what is shown on vertical axis goes to farmer. 


TR 16.5 
15.4 

13.8 

11.1 

8.3 


5.3 


(0) 1 2 3 4 5 6 total bushels 


(c) How would the integer allocation of 6 bushels differ if the overall objective 
were to maximize the total income from all three markets while keeping their 
individual market incomes as close to being the same as possible? 


Qı =2 for TRi=4. Qo=3 for TR2 = 3.8. Q3 = 1 for TR = 5.3. Total $ = 13.1 
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3. Suppose you wish to minimize flood risks in two towns. Flood risk is measured 
in expected property damage. You have $2 million to spend on flood risk reduc- 
tion. Construct an optimization model and solve it to determine where to spend 
the $2 million that maximizes total reduction using the hill climbing method. 


Investment, $10° Total Reduced risk 


Town A Town B 
1 12 18 
2 22 D7. 


Let 
Ra(A) be the reduction associated with an investment of A to Town A 
Let Rb(B) be the reduction associated with an investment of B to Town B 
Maximum investment = 2. 
Maximize Ra(A) + Rb(B) Subject to A+B < 2. 
Hill Climbing : Assume integer allocations | and2 x 10° 
First million to B(since 18 > 12) 
Second million to A (since 12 > 9). 
Total reduction = 30% 


an 


. Models for Managing Money 


ee 


. What is $1 invested today at 7% per year, compounded annually, worth at the 
end of 10 years? 
About $2. Doubles every 10 years at 7% per year. Assumes no taxes. 
. How long will it take to double your investment if it is earning 10% per year 
About 7 years. Assumes no taxes. 
. What is the value of $1 invested for a year if compounded at 1% per month? 


N 


W 


FV, = $1(1 + 0.12/12)! = $1.1268 if no taxes. 


4. What would be the answer to the previous question if an annual nominal interest 
rate of 12% were compounded continuously within the year? 


FV, = $1 e®!? = $1.1275 if no taxes. 


Nn 


. Suppose after you graduate and begin receiving an income you start investing 
$6000 at the end of each year into a tax-free retirement account that earns 8% 
per year. You do this for only 10 years, and then just leave it in the account 
earning 8% interest each year for the next 30 years when you decide to retire. 
Alternatively, you only start investing $6000 per year into this tax-free account 
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on the 11th year of employment and keep investing annually for the remaining 
30 years. Which investment strategy will result in a higher retirement fund at 
the end of 40 years of employment? 


First = [6000((1 + 0.08)'? — 1)/0.08](1.08)°° = $ 874, 639.80 
Second = 6000((1 + 0.08)°° — 1)/0.08 = $ 679, 699.30 


6. How much money are you going to need when you retire to assure you can meet 
your standard of living for the remainder of your life? Specify all the assump- 
tions you are making, considering taxes and inflation. How are you going to 
get that amount of money (i.e., your savings plan)? 

Estimate money needed now to meet standard of living and inflate it to retire- 
ment age. Find present value of amount needed at retirement age to be able to 
withdraw this amount, after-taxes, each year for your estimated remaining life. 
Propose your plan for obtaining this amount of savings needed at retirement 
age. 

7. One criterion for plan selection is the one that produces the maximum net 
annual benefits. The maximum benefit-cost ratio, or annual benefits divided by 
annual costs, is another criterion. Benefit—cost ratios should be no less than one 
if the annual benefits are to exceed the annual costs. Consider two projects, I 
and II: 


Project 

I II 
Annual benefits 20 2 
Annual costs 18 1.5 
Annual net benefits 2 0.5 
Benefit-cost ratio 1.11 1.3 


What additional information is needed before one can determine which project is 
the most economical project? 

If there are funds available for the more expensive project, then the return from 
investing the remaining funds if the cheaper project be selected must be known before 
either project can be identified as the preferred one. Annual benefit-cost ratios, or 
net benefits, can be used interchangeably to evaluate alternative investment plans 
only if the total amounts of money available are the same. If they are the same, the 
plan having the largest benefit-cost ratio will also have the largest net benefits. 

In this case, project II costs 1.5 of the 18 available so you have 16.5 left over and 
that plus the 2 annual benefits earned is 18.5 total annual benefits. The b/c ratio 
is 18.5/18 = 1.03, not 1.3. Thus, both the net benefits and benefit cost ratios are 
consistent. Project I is preferred. 


8. Bonds are often sold to raise money for infrastructure investments. Each bond 
is a promise to pay a specified amount of interest, usually semiannually, and 
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to pay the face value of the bond at some specified future date. The selling 
price of a bond may differ from its face value. Since the interest payments are 
specified in advance, the current market interest rates dictate the purchase price 
of the bond. 


Consider a bond having a face value of $10,000, paying $500 annually for 
10 years. The bond or “coupon” interest rate based on its face value is 500/10,000, 
or 5%. If the bond is purchased for $10,000, the actual interest rate paid to the 
owner will equal the bond or “coupon” rate. But suppose that one can invest money 
in similar quality (equal risk) bonds or notes and receive 10% interest. If this is 
possible, the $10,000, 5% bond will not sell in a competitive market. To sell it, its 
purchase price must be such that the actual interest rate paid to the owner will be 
10%. In this case, what is the bond currently worth? 


Solution : 


1.10)!9 — 1 1 
56927 = soo] $ ' | om 


0.10(1.10)!9 (1.10)!° 


The interest paid by some bonds, especially municipal bonds, may be exempt 
from state and federal income taxes. If an investor is in the 30% income tax 
bracket, for example, a 5% municipal tax-exempt bond is equivalent to about a 7% 
taxable bond. This tax exemption helps reduce local taxes needed to pay the inter- 
est on municipal bonds, as well as providing attractive investment opportunities to 
individuals in high tax brackets. 


9. Assume a particular university’s tuition and fees are $C today. 


Assume the after-tax interest rate you can earn in the next 24 years is 5%. 
Assume the inflation rate of tuition and fees in the next 24 years will be 4%. 
Show how to determine how much money would be enough to invest today to 

pay for four years of tuition and fees starting at the beginning of 20 years from 

now. 


Just set up the equations needed to find the answer. (Drawing a picture may 
help.) 


One Solution: 


ALLL wn 


0 19 20 21 22 23 


Amount needed at end of year 19: $c[(1 + 0.04)*19 
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+((1 + 0.04)*20) (1 + 0.05) + ((1 + 0.04)*21)/(1 + 0.05)*2 
+((1 + 0.04)*22)/(1 + 0.05)*3.] 
Discount this amount 19 years at 5% using (1 + 0.05) — 19 
Thus the money needed today : f$c[a + 0.04)^19 
+((1 + 0.04)*20)/(1 + 0.05)((1 + 0.04)*21)/(1 + 0.05)^2 
+((1 + 0.04)*22)/(1 + 0.05)*3.]}/(. + 0.05)^ — 19 
Equivalently : 
Present value = $c[(( + 0.04)/(1 + 0.05)}*19 
+((1 + 0.04)/(1 + 05))*20 + ((1 + 0.04)/(1 + 0.05))*21 
+((1 + 0.04)/(1 + 0.05))*22] 


You must pay back a debt, say of $1000, with interest, in 12 equal end-of- 
month payments. Each monthly payment contains both some of your debt and 
the monthly interest owed on the remaining debt. The bank tells you the annual 
interest rate is 5%. Describe how you could determine the annual interest rate 
you actually paid on the debt you owed. 


Solution. 


Compute the twelve equal monthly payments, A, given a present value of $1000. 
Use the monthly interest rate i = 0.05/12 


1000 = a] 1/(1 LiN+1/A402 4---4+1/04 i)!?] =al(a 1 j)!2 1)/(a i i)'i)] 


Total interest paid is the sum of all 12 payments A less the debt of 1000. 
Annual interest rate that converts 1000 to the sum of all A values is 
1000(1 + i) = sum of all A values. 


Alternatively : 
Divide the sum of monthly payments by the principal, $1000, and subtract 


1 from that value to compute the actual effective annual interest rate. 


For this example : 
A = 85.61 end of period payments 
Sum of allA = 1027.32 = 12*A 
Effective interest rate : 0.02728978 


= 2.73% annual interest rate, or 27.32 total interest paid. 


Note : If you paid off the entire debt at the beginning of the year, your interest 
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payments would be 0 since you have no debt over time. If you waited to the end of 
the year your interest payment would be $50. Since you are paying off the debt 
throughout the year the total interest paid would be just over half the difference 
between $0 and $50. 


11. You are considering taking flying lessons that if begun today will cost $10,000. 
Alternatively, you could wait a year to begin the lessons after paying the fee 
(that is likely to be higher) at that time. 

(a) If you decide to wait a year and invest the $10,000 during the year, earning 
an annual interest rate i, describe how would you determine the extra 
money you would have at the end of the year after paying the inflated 
cost of lessons at that time? 


After investing for a year, the 10,000 will become 10,000(1 + i). 

Inflated cost of flight instruction is 10000(1 + f) where f is the annual rate of 
inflation. 

The extra money you will have is 10000 [(1 + i) - (1 + f)] = 10,000 (i—f). 

Alternatively: 

Computing the difference in current dollar value using the real (uninflated) rate 
of return r: 

Difference is 10000(1 + r) — 10,000 and since (1 + i) = (1 + r)\(1 + f), the 
difference in current dollar values is 100000. + i)(1 + f) — 10,000. 

This difference expressed in beginning of year I dollars is 

[100001 + i)/(1 + f) — 10000] (1 + f) = 10,000 [ d + 1) - (1 + f)] = 10,000 
=f) 


(b) Assume you forgot to consider the fact that you will owe income taxes on the 
interest earned. Your income tax rate is t. How would your solution change if 
you include the tax payment? 


Solution : Replace each ‘i’ with ‘1 (1 — ty. 


12. You must pay back a bank debt, say of $1000, with interest, in 3 equal end-of- 
year payments. Each payment contains the interest on the debt at the beginning 
of the year and some of the principal. 


(As the debt decreases so do the interest payments in each successive A. The 
interest paid, Iy, at the end of a year y is based on the debt, Py, at the beginning 
of that year.) 

The bank tells you the annual interest rate is 5%. 

Show how to compute the principal and interest contained in each of the three 
end-of-year payments ‘A’ using the following steps: 


(a) Write the equation for solving for payments A: 
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Compute the three equal annual payments, A, given a present value of $1000. 
Use the annual interest rate i = 0.05. 


1000 = A[1/(1 +i + 1/0 +i? +1/(1+i)°] and solve for A 
(b) Show the equation for computing for the first interest payment, I4: 
1000(@) = 1 


(c) Given A and I), show the equation for computing for the remaining debt at 
beginning of 2nd year, P1: 


Pı = 1000—(A—/) 
(d) Show the equation for computing for the interest paid in the 2nd payment: 
PDO =h 
(e) Given A, P; and I, solve for the remaining debt at beginning of 3rd year: 
P2 = Pı — (A-h) 
(f) You can deduct 30% of the annual interest payment from your income tax each 
year. Given all the interest payments Iy and A, show the equation you could 


use to compute the actual interest rate you are paying on your debt. 


1000 = (A — 0.34)/(1 + i) + (A — 0.3b)/(1 + i)? + (A —0.34)/1 + i)? 


N 


. Solving Models Using Excel 


= 


. Regression involves finding functions that best fit some observed data. One 
criterion is to minimize the sum of squared deviations from observed and pre- 
dicted values. Suppose you have a set of observed (known) x,y values, say x(i) 
and corresponding y(i). 


yi): 4 10 18 11 22 7 10 14 193 
x(i):2 4 8 6 1035791 


Define and solve an optimization model to determine the parameters of a non- 
linear function y = a + bx° that best fits the above data. 


Exercise Solutions 255 


X 


Minimize Xi [y(i) — (a + b x(i)°)}* to find the best values of a, b, c if non-linear. 
If linear, c = 1 and the values of ‘a’ and ‘b’ will differ. This optimization will define 
the parameters ‘a’, ‘b’, and ‘c of the function y = a + bx 


| a 8 c D E F G H 
las 2.044862 sample y(i) x(i) a+bX*c y-(atbX*c) Squared > A = a 
2 b= 1.1035 “44 2 4673113 -0.67311 0.453081 hiig aa sy a ona 
3\c= 1.252016 2 10 4 8.304675-1:695325-—2.874127 _ jenmen / a 
4 A 3 18 8 16.95411 1.045836 1.093877 aA 
5 | 4 u 6 1244479 -144479 2.087431 a oa 
6 5 2 10 21.7595 0.2405 0.05784 = 
7 | 6 7 3 6411388 0.588612 0.346464 
8 7 10 5 10.32227 -0.32227 0.103857 = 
9 | 8 14 7 14.65875 -0.65875 0.433953 J tnasa 
10 | 3 19 9 19.32311 -0.32311 0.104399 J -= 
n 10 3 1 3.148362 -0.14836 0.022011 / B gs ene satin noa te 
12 igea song hi marire za 
3| | Sum= 7.57704 Sanden 
we Meet he BD. emanee engere te iene Potem ME oe aeon norane Lorat me LP 
bmp enges Se ine Lome: Moaea png ieit Ma frontono ongne tor Lote 
n teres annen 
Solution: Regression Function 
~ Cej 
r 
30 
20 
10 
o 


123456 7 8 9 10111213 


2. Find the four linear functions that best fit the following four sets of data. Then 
plot the data. What does this tell you about fitting functions to data? 


Anscombe’s quartet 
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Anscombe’s quartet 


11.0 8.33 11.0 9.26 11.0 7.81 8.0 8.47 
14.0 9.96 14.0 8.10 14.0 8.84 8.0 7.04 
6.0 7.24 6.0 6.13 6.0 6.08 8.0 5.25 
4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50 
12.0 10.84 12.0 9.13 12.0 8.15 8.0 5.56 
7.0 4.82 7.0 7.26 7.0 6.42 8.0 7.91 
5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89 
Solution: 


For each set of data, the mean of x and variance of x are the same. The same applies 
to the mean and variance of all the y values. The linear regression line is the same 
for all data sets. The other data presented in the table below is just for information. 


Property Value Accuracy 
Mean of x 9 exact 
Sample variance of x : o’ 11 exact 
Mean of y 7.50 to 2 decimal places 
Sample variance of y : o 4.125 +0.003 
Correlation between x and y 0.816 to 3 decimal places 
Linear regression line y = 3.00 + 0.500x Ze aga paos, 
Coefficient of determination of the linear 0.67 O doana eons 


regression : R? 


Plots of each set follow. What this shows is that it is important to visualize the data 
as just a good regression can be misleading. 
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7. Discrete Optimization Modeling 


1. Consider the problem of allocating resources to three users. The allocations 
are X, Y, and Z. User 1 total revenue is 6X-X2. User 2 total revenue is 7Y— 
1.5Y7. User 3 total revenue is 8Z-0.5Z?. The goal is to maximize (6X-X?) + 
(WY¥-1.5Y°) + (8Z-0.5Z7) given 6 units of resources available. 


Show how to solve this allocation problem using discrete dynamic programming 
with integer allocations. Show how the dynamic programming network would be 
modified to be able to consider 8 integer resources as well as 6 resources to allocate 
to the three users having the same net benefit (total return) functions. What would 
the integer allocations and total returns be given 8 available resources? Show how 
this can be solved using the forward moving and backward moving approaches. 

To show that DP was used, show all F(S) values for each node S, and best 
decision (arrow or heavy line) if more than one possible decision. 


Solution : Backward method 
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For 8 resources : X =2,Y = 1, Z = 5 foratotal of $ 41. 
For 6 resources: X = 1, Y = 1, Z = 4 for a total of $34.5. 


2. (a) Using dynamic programming (i.e., a DP network) solve the following capac- 
ity expansion problem for the next 20 years (4 5-year construction periods) 
using forward and backward moving approaches. 


The following table provides estimates for the costs of additional water treatment 
plant capacity needed at the end of each 5-year period for the next 20 years. Find 
the capacity expansion schedule that minimizes the present values of the total 
future costs. If there is more than one least-cost solution, indicate which one you 
think is better, and why. 


Exercise Solutions 259 


Period Years Discounted cost of additional Total required 
Capacity Capacity at end of pperiod 
Units of additional capacity 
2 4 6 8 10 

1 1-5 12 15 18 23 26 

2 6-10 8 11 13 15 

3 11-15 6 8 8 

4 16-20 4 10 


Note: The discrete options in the first 5-year period are to add 2, 4, 6, 8 or 10 
units of capacity. In period 2 one can add any discrete even amount of capacity 
up to a total capacity of 10 units so if the beginning period capacity is 2 at least 
4 and at most 8 units can be added. And so on to the last period which must have 
an initial capacity of at least 8, and if so only two units can be added to reach 10 
units total. 


(b) The cost in each period t must be paid at the beginning of the period. What was 
the discount factor used to convert the costs at the beginning of each period t 
(say C(t)) to present value (or discounted) costs shown above? In other words, 
how would a cost at the beginning of period t be discounted to the beginning 
of period 1, given an annual interest rate of r? (Only the algebraic expression 
of the discount factor is asked for, not the numerical value of r.) 

(c) How would you deal with the uncertainty of future demands and costs? In 
other words, how would you use a model like the one you developed? 

(a) Forward method 


Capacity 


260 


(c) 
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(a)Solution1 : 10, 0, 0, 0. 
(a)Solution2: 6,0,4,0. 

Both costing 26. 

Best decision? Depends on other criteria. 


(b) The discount factor is : 


(Cost at beginning of 5 — year period t)/(1 + annual interest rate) D 


Of interest is the first decision. Will extending the planning period to 25 years 
or altering the demand function change the first decision? If not, no problem. 
Solve problem over again in 5 years with updated data (guesses). In other words, 
use the model sequentially every 5 years (or when needed) always seeing how 
sensitive the current decision is to all the future assumptions. 


3. Water Quality Management Model: 


Find the wastewater treatment efficiencies at sites 1 and 2 that meet stream quality 
standards at sites 2 and 3 at a total minimum cost. Currently there is no treatment. 
All the wastewater is discharged into the stream. 


Wastewater: 
100 kg/day 


Wastewater: 


200 kg/day 
Current Pollutant Concentrations: 58 mg/l 95 mg/l 
Maximum Allowable Concentrations: 18 mg/l 23 mg/l 


Available Data: 

Stream flow = 1000 m3/day at all sites. 1 kg/day/1000 m3/day = | mg/l. 
Fraction of waste discharged into stream at site 1 that reaches site 2: 0.25. 
Fraction of waste discharged at site 1 that reaches site 3: 0.15. 

Fraction of waste at and discharged into stream at site 2 that reaches site 3: 


0.60. 


Limits of treatment: removal of 30% required, but no more than 90%, for both 


sites. The initial concentration just upstream of site 1 is 32 mg/l. 


The marginal cost of treatment at site 1 is 30 over the range of possible 


treatment fractions. 
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The marginal cost of treatment at site 2 is 20 over the range of possible 
treatment fractions. 

Can you find the least-cost solution that meets the quality standards using 
dynamic programming? 


Dynamic programming formulation: 
States are existing discrete concentrations at site i 
Stages are sites i 
Decisions are the waste removal fractions x; at sites i = 1 and 2. 


Transition function for pollutant concentrations Pj; at sites j (quality). 


Pj = [Pi + (Wi/QD (1 = xi) Jaij where a]? = 0.25 and a23 = 0.60. 


M: 
Pj < P! ax 
Xi $, "á SN : X2 $ 
GY / 08415 56N 
A —~0.87 26 0.76 15.2— (23) 
0:90 27 0.1515 


While this is a dynamic programming network one does not need to apply 
dynamic programming to see the least-cost path from 32 mg/l at site 1 to 23 mg/l 
at site 3 while not exceeding 18 mg/l at site 2. That path involves 80% treatment 
at both sites 1 and 2. Pollutant levels less than 23 at site 3 were not considered 
since that would add to the cost that is to be minimized. 


4 Blueberries 


There are three farmer’s markets that sell organically and locally grown blueber- 
ries. The farmer who grows these blueberries gets 90 percent of the income from 
their sales; the markets get the other 10%. The demand for blueberries differs at 
each market. Some smart economist has determined that the demand (unit price) 
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functions for blueberries at the three markets (m = 1,2,3) are 6/11 + Q1), 7/1 + 
1.5Q2), and 8/(1 + 0.5Q3), respectively. 


i Demand functions for blueberries. 
Unit 


price 


Qm 


At each market m the unit price varies each week depending on the amount 
of blueberries available, Qm, to be sold. How should the farmer distribute a crop 
ranging from 1 to 6 bushels of blueberries each week to maximize the total amount 
of income received from all three markets? 

Solve for the maximum revenue obtainable from a total of 6 bushels using 
discrete dynamic programming, assuming integer allocations. Use both backward 
and forward approaches. Show your work on a network, not just the solution. 

Assuming beginning with a total of 6 and a maximum allocation to each market 
of 4 and a minimum allocation of 1, and ending with nothing left over: 


Solution: 
Q =1; Q@=1; Q=4. 


The links are the possible allocations, Q. The number on each link is the total rev- 
enue, TR, obtained for the particular allocation. The black numbers in the nodes are 
the remaining bushels of blueberries to be allocated. The red numbers are the max- 
imum revenue obtainable from remaining allocation decisions. The green numbers 
are the maximum revenue that could be obtained from previous allocation decisions. 
The red and green numbers depend on the existing remaining bushels, the black num- 
bers. The red links are the best decisions going forward, obtained from the backward 
moving approach, and the green arrows are the best decisions to have been made 
getting to the node or state, found by using the forward moving approach. 
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8. Linear Optimization Modeling 
1. Bake Sale: 


For a community fund raising event cakes and pies are to be sold. Find how many 
cakes and pies should be baked to maximize total income. 

Let A and B be the number of cakes and B the number of pies produced. The 
following data apply: 


Product: AB 
Income per item $6 $8 
Pans required per item 1 1 
Labor required per item 2 4 


There are 80 pans and 280 person hours available, and because of limited cake 
ingredients, no more than 50 cakes (A) can be produced. 


The model can be written 
Max total income = 6A + 8B 
Subject to : 
Pan Constraint : A+ B < 80 
Labor Constraint : 2A + 4B < 280 
Ingredient Constraint : A < 50 
Model Solution : A = 20, B = 60, total income = $600. 


Solution: $600 
A=20 
B=60 


Feasible 


Region 


2. Diet model 


You manage the local SPCA (Society for the Prevention of Cruelty to Animals). 
Your dogs need to eat and there are two varieties of dog food available: foods 
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D and C. Their unit costs are $1.10 and $0.90 respectively. Your job is to find 
the least-cost combination of pounds of D and C for each dog that meets various 
nutrition constraints shown on the table below. The ingredients are expressed in 
per pound of D and C. 


Ingredient D Cc Daily minimum/dog /day 


Protein 3 ounces 4 ounces 8 ounces 
Carbohydrate 5 ounces 12 ounces 11 ounces 
Tron 30mg 35mg 100 mg 


(a) First describe your objective function and constraints in words. 


Minimize cost of dog food while providing requirements for protein, carbohydrate, 
and iron. 


(b) Define the parameters and variables, and their units, that you can use to create 
a mathematical model. 


Parameters : ounces of protein per pound of D and C. 
Ounces of carbohydrate per pound of Dand C. 
Mg of iron per pound of D and C. 
Cost per pound of Dand C 
Minimum daily requirements of ounces of proteinand carbohydrate 
Minimum daily requirements of mg of iron. 


Variables: Pounds of Dand C to buy. 
(c) Express the model mathematically. 


Model: Minimize 1.10 D + 0.90 C 
subject to : 
3D+4C > 8. 
5D + 12C > 11. 
30D + 35C > 100. 
D>0,C>0. 


(d) Show the solution by plotting the constraints and objective function on a graph 
of D versus C. 
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c Solution: $ 2.57 
4 C = 100/35 = 2.86 
100/35 * D=0 


Feasible region 


5/11 8/3 100/30 


3. Labor Scheduling: 


A social welfare program involves three projects. Projects A, B, and C require 18, 
12 and 30 person months to complete. Four qualified social workers are available 
to work on these projects. 

Their monthly salaries are $3000, $3500, $3200, and $3900 respectively. 

All projects must be completed in 18 months, and each social worker can be 
assigned only to one project in each 6-month period. Multiple workers can be 


assigned to the same project. 
Find the allocation of each worker to each job that minimizes the total cost of 


completing the projects. 


Solution : 
Consider 6 — month work periods t, 
Variables : Xijt = 1 if worker Wiis assigned to project j during 
period t, 0 otherwise. 
Si = salary of Wiin six — month period = 6 times monthly salary. 
Pj = labor requirements of project j in6 — month periods. 
Person — periods. P1 = 3, P2 = 2and P3 =5 
C = total cost 
Model : 
Minimize C 
Subject to : 


5 Xijt<1 Vit limits each worker to only one job in each period t 
j 
C= ae Si Xijt where S1 = $3000*6, S2 = $3500*6, S3 = $3200*6, 
ij 


S4 = $3900*6 
Pj=9_ Xijt, Vj where P1 = 3, P2 =2and P3=5 
it 
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All Xijt are integer binary 0, 1 variables. 
Solution : Cost = $198, 000. 


t:1, 2,3 
Wi: ACB 
W2: AC 
W3: ACB 
W4: C 


4. A transportation problem 


Assume there are 4 warehouses containing Personal protective equipment, com- 
monly referred to as “PPE,” supplies being used at 6 hospitals. Given the supplies 
available at each warehouse and the demand at each hospital, and the unit costs of 
transporting them (all known values), construct a model to determine how much 
gets transported from each warehouse to each hospital that minimizes the total 
transportation costs. 

To do this you need to make up your notation for all variables and parameters. 
Plug in values of the parameters of the model and solve it to find how much is 
shipped from each warehouse to each hospital. 

What condition must be satisfied for your model to be feasible? 


Solution: Let X (i, i) be the amount shipped 
from supply warehouse i to hospital j. 
W (i) be the supply of PPE available at warehouse i. 
H (j) be the demand for PPE at hospital j. 
C(i, j) be the cost per unit of PPE shipped from i toj. 


Minimize ay De: C(i, j)X (i, /) 
SV XGDZAD j=1,...,6 
Ve XGDEWO i=1,...,4 


To be feasible the total supply at all warehouses must equal or exceed the total 
demand at all hospitals. 


5. Forest management 


A particular State Forest has four different subareas whose characteristics such as 
species composition, age distribution, drainage, soil characteristics, etc. are simi- 
lar. The areas of these subareas are known. Recent growth studies have produced 
predictions of the volumes per hectare for each subarea for the next 50 years. The 
forest manager is responsible for defining a cutting schedule that will produce a 
steady supply of logs to be cut into lumber over the 50-year life span of the forest. 
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Her goal is to maximize a constant amount of wood (volume) that can be converted 
to lumber every year. 

Develop a model for determining just how much volume can be cut in each 
subarea in each of 5—10-year periods. Once trees in any area are cut that area 
cannot be cut over again for another 50 years. Cutting trees from the forest in this 
sustainable way increases water yields, the quality of wildlife habitat, and timber 
income. 

Define the variables, parameters, and constraints you need, and use them to 
build and solve a model for identifying the best cutting schedule—i.e., how much 
to cut, where, and when. 


Let volume be the unknown maximum constant volume of wood 
cut from the forest in each period . 
H(j, t) = the number of hectares to be cut in 
subarea j in period t. (unknown) 
VG, t) be the known estimated average volume 
per hectare in subarea j in period t. 
A(j) = the known total number of hectares of land in subarea j. 
Maximize V olume 


Subject to : 
DAG t)V(j,t) > Volume t =1,...,5 
> BG) < AM) j=l,...,4 
6. Water Quality Management Model 
Find the wastewater treatment efficiencies at sites 1 and 2 that meet stream quality 


standards at sites 2 and 3 at a minimum total cost. Currently there is no treatment. 
All the wastewater is discharged into the stream. 


Wastewater: 
100 kg/day 


Wastewater: 


200 kg/day 
Current Pollutant Concentrations: 58 mg/l 95 mg/l 
Maximum Allowable Concentrations: 18 mg/l 23 mg/l 


Available Data: 
Stream flow = 1000 m?/day at all sites. 1 kg/day/1000 m?/day = 1 mg/l. 


268 Exercise Solutions 


Fraction of waste discharged into stream at site 1 that reaches site 2: 0.25. 

Fraction of waste discharged at site | that reaches site 3: 0.15. 

Fraction of waste at and discharged into stream at site 2 that reaches site 3: 
0.60. 

Limits of treatment: removal of 30% required, but no more than 90%, for both 
sites. The initial concentration just upstream of site 1 is 32 mg/l. 

Can you find the least-cost solution that meets the quality standards without 
knowing the cost functions for treatment? 


Solution : 
Model : Assume marginal costs, cl and c2, are constant 
between0.3 and 0.9 removal 
fractions and that because of greater waste loads at site 1 than at 
site2,c1 > c2. 
Minimize = c1*x1 + c2*x2. 
Quality at site 2. 
(32 + 200*(1 — x1))*0.25 < 18. 
Quality at site 3. 
(32 + 200*(1 — x1))*0.15 + 100*(1 — x2)*0.60 < 23. 
Treatment restrictions. 
xl < 0.9; x2 < 0.9; 
xl > 0.3; x2 > 0.3; 
Marginal costs : 
cl = 30 assumed . 
c2 = 20 assumed. 
Model Solution : 
Objective value : 40.00000 
Variable Value 


xl 0.8000000 
x2 0.8000000 


Equivalent model : 
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, Minimizecl xl + c2 x2 
if Subject to: 


rg xl = 0.8 
F 5x1 +10x2 > 12.0 
S oS 0.3<xl<0.9 \ 
“vas 0.3 <x2 < 0.9 . 
x2 Ni ` 


Note: From diagram, one can see if cl > 1/2 of c2, solution will stay at above 
solution. One may not need cost data to find the least-cost solution. (But financing 
folks will need to know this.) Message: Use models to determine what data are 
needed and how accurate those data must be to identify optimal solutions. 


9. Some Linearization Methods 
1. Groundwater pumping: 


This is an exercise in the use of fixed costs and piecewise linear variable costs. 
Show how to consider the following cost functions for supplies S. 


. Fixed = 0, variable = 10, 

. Fixed = 0, variable = 5, 

. Fixed = 0, variable = 8 to S = 5, then 15. 
. Fixed = 20, variable = 5, 

. Fixed = 14, variable = 4 to S = 6, then 12, 
. Fixed = 20, variable = 5 to S = 7, then 3. 


ANB WN 


270 Exercise Solutions 


Cost 


20 
14 


5 67 Supply S 


Develop a model to find the minimum cost to meet a demand from two sources 
of groundwater. Assume: 


Qa = flow from source A—unknown m?/day 

Qb = flow from source B—unknown m?/day 

Ca(Qa) = cost function. $ 

Cb(Qb) = cost function. $ 

Demand = required to be met. m?/day 

Ka, Kb = maximum flow capacity of well fields A and B, respectively. m?/day 
To find cost effective ways of meeting demand: 


Minimize: Cost = Ca(Qa) + Cb(Qb) 
Subject to : 
Qa + Qb > Demand 
Qa < Ka 
Qb < Kb 


Plus the equations and variables needed to convert cost functions 
Ca(Qa) and Cb(Qb) toa linear form, including the use of 0, 1 variables. 
For eitheraorb: 
For cost function 1: C(Q)=10@Q 
For cost function2: C(Q)=5@Q 
For cost function3: C(Q) = 8(qg1) + 15(g2); q1 + q2 = Q; q1 <5. 
For cost function4: C(Q) = 20z + 50; Q < K z;z = (0,1). 
For cost function 5: C(Q) = 14z + 4(q1) + 12(q2); q1 < 6; q1 + q2 = Q. 
Q < Kz;z= (0,1). 
For cost function 6: C(Q) = 20(z1) + 5(q1) + (20 + 7 x 5)(z2) + 3(q2). 
q1 < 7(21); q1 + 7(@2) + q2 = Q; z1 + z2 < 1. z1 = (0, 1); z2 = (0, 1). 
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Now consider increasing demands for flow over time. Develop a model that finds 
the minimum cost pumping schedule over time. Just assume Ca() and Cb() as the 
cost functions for adding additional flow capacity in any period t. 


Let : 

Qa(t), Qb(t) = the total flow from each wellfield at end of period t, 
AQa(t) and AQb(t) = the additional flow added in period t, 

Ca(AQa(t)) and Cb(AQb(t)) = the cost of the additional flow, 

Demand (t) = the demand at end of period t 

Given the estimated demands Demand (t) over time t, the capacity expansion 
problem is to : 


Minimize x (1 +r) ‘[Ca(AQa(t)) + Cb(AQD(t))] 


t 
Subject to : 
Qa(t) + Qb(t) > Demand (t) 
Qa(t — 1) + AQa(t) = Qa(t) 
Qb(t — 1) + AQH(t) = Qb(t) 
Qa(t) > Qa(t — 1); Qa(t) < Ka 
Qb(t) > Qb(t — 1); Qb(t) < Kb 
For allt 


2. Capacity expansion problem 


To meet a growing demand for public housing, a community has decided to build 
more housing units. There are two sites where this can be done, and the question 
is which site is less expensive over time. Assume these sites are named A and B. 
Let A(t) and B(t) be the capacity of each of those sites at the beginning of period 
t. Let KA(t) and KB(t) be the added capacity in period t, costing Ca(KA(t)) and 
Cb(KB(t)). Construction periods last 5 years; hence each period t will be a 5-year 
period. Costs must be paid at the beginning of each period. 
Cost functions: 


Ca(KA(t)) = 15 + 8 KA(t) if KA(t) > 0; otherwise = 0. 
Cb(KB(t)) = 5 + 9KB(t) if KB(t) > 0; otherwise = 0. 
Assume these apply in each period t. 
r = annual interest rate. Discount factor : 1/((1 + r) A (5 * (t — 1))) 
Projections of future demands for public housing have been made. Estimates of total 
capacity requirement are: 
End of period 1 5 
End of period 2 10 
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End of period3 18 
End of period 4 33 


Solve using linear programming, and show the sensitivity of the solution to the 
value of the annual interest rate r. 

Variables At and Bt are the additions to the housing capacities in period t. Ct is 
the total housing capacity at the end of period t. CAt and CBt are the costs incurred 
in period t. Dt is the discount factor for period t. 


Minimize PWC; 
D1 = 1; D2 = 1/(1 + r)^5); D3 = 1/(1 +r)“ 10); D4 = 1/(1 +7)“ 15); 
PWC = D1 x(CAI + CB1) + D2 x (CA2 + CB2) 
+ D3 * (CA3 + CB3) + D4 x (CA4 + CB4); 

CA1 = 15*« ZA1 + 8*Al; CB1 = 5x ZB1 + 9x Bl; 

MaxCap x ZA\ > Al; MaxCap « ZB1 > B1; 
CA2 = 15 x ZA2 + 8*A2; CB2 = 5* ZB2 + 9 x B2; 

MaxCap x ZA2 > A2; MaxCap « ZB2 > B2; 
CA3 = 15 x ZA3 + 8 x A3; CB3 = 5x ZB3 + 9 x B3; 

MaxCap x ZA3 > A3; MaxCap « ZB3 > B3; 
CA4 = 15 x ZA4 + 8x A4; CB4 = 5x ZB4 + 9 x B4; 

MaxCap x ZA4 > A4; MaxCap « ZB4 > B4; 
All ZAi and ZBi variables are binary 0, 1 values. 
Demands; 
Al + B1 = Cl; Cl > Deml; C1 + A2 + B2 = C2; C2 > Dem?; 
C2 + A3 + B3 = C3; C3 > Dem3; C3 + A4 + B4 = C4; C4 > Demé; 
MaxCap = 50; 

Set r to different values as shown below. 


Demands at end of period t : 


Deml = 5; 

Dem2 = 10; 
Dem3 = 18; 
Dem4 = 33. 


Results : 
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z è = = 0.10: 
r=0: r=0.01: r = 0.02: r=0.05: k 
> t Variable Val Variable Value Variable Value 
Variable Value Variable Value oa : or PWC 201.38 PWC 143.05 
PWC 279.00 PWC 275.15 D1 1.000000 D1 1.000000 D1 1.000000 
p D1 1.000000 apan D2 0.7835262 D2 0.6209213 
D1 1.000000 D2 09514657 22 09057308 D3 0,6139133 D3 0.3855433 
D2 1.000000 j D3 00.8203483 D4 0.4810171 D4 00.2393920 
D3 0.9052870 D4 0.7430147 CB1 50.00 cB1 50.00 
D3 1.000000 D4 0.8613495 CB1 50.00 CB2 50.00 CB2 50.00 
D4 1.000000 CB1 95.00 CA2 119.00 cB3 77.00 cB3 77.00 
CA1 279.00 CA3 199.00 CA4 135.00 CA4 135.00 CA4 135.00 
B1 10.00000 B1 5.000000 B1 5.000000 B1 5.000000 
A1 33.00000 a 25.00000 A2 13.00000 B2 5.000000 B2 5.000000 
C1 33.00000 ©1 10.00000 ^4 15.00000 ih. anes a ai 
. f A4 15.00000 
C2 33.00000 ¢2 10.00000 C} 5000000 cı 5.000000 C1 5.000000 
c3 33.00000 C3 33.00000 c2 18.00000 C2 10.00000 C2 10.00000 
C4 33.00000 5 = oe C3 18.0000 c3 18.00000 
e c4 33.00000 ; 


c4 33.00000 


Notice the impact of an increasing interest rate on the capacity expansion 


schedule. 


3. There are two users of resources, A and B, whose income depends on the 
resources they receive. Let those allocations be A and B respectively. The 
income to user A equals 10A-0.5A?. The income to user B is 5B-0.25B?. 

(a) What are the allocations that result in the maximum total income? 

(b) If you have only 14 resources to allocate, show how you could get an 
approximate solution using linear programming. 

(c) Show how the model would be modified to obtain the maximum equal 
income for both users. 

(a) Finding the slope functions by differentiating each function and setting them 
to 0 results in A and B = 10. 


(b) Consider the following allocation model. 


Maximize (10A-0.5A”) + (5B-0.25B7) where A and B cannot exceed 14. 
A solution: Dividing A and B into two segments of 5 each. Calculate the linear 
slopes and add the new variables and constraints as indicated below. 


10*(5) — 0.5*(5°))/5 = sal; 


— ((10*(5) — 0.5* (5?))))/5 = sa2; 


( 

((10* (10) — 0.5* (10°) 
(5 x (5) — 0.25 x (5°)) 
( ’) 


) 
/5 = sbl; 
(5*(10) — 0.25*(10 ) 


— (5*(5) — 0.25* (5°)))/5 = sb2. 
Linear model: 


Max =sal*al + sa2*a2 + sb1*b1 + sb2*b2. 
A=al-+a2;al < 5; B = b1 + b2; b1 <5; 


[Res]A +B < 14. 
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Global optimal solution found. 


Objective value: 66.25000 
Variable Value Reduced Cost 

SAI 7.500000 0.000000 
Al 5.000000 0.000000 
SA2 2.500000 0.000000 
A2 4.000000 0.000000 
SBI 3.750000 0.000000 
Bl 5.000000 0.000000 
SB2 1.250000 0.000000 
B2 0.000000 1.250000 

A 9.000000 0.000000 

B 5.000000 0.000000 


Row Slack or Surplus Dual Price 
RES 0.000000 2.500000 


Or one could take mid slopes of original non-linear income functions (that range 
from 0 to 10), say at 2.5 and 7.5 for determining the linear slopes of the approximate 
income function in each segment. Finding slopes of functions is discussed in the next 


chapter. 


Maximize sal*al + sa2*a2 + sb1*b1 + sb2*b2. 
(10 — (2.5)) = sal; 
(10 — (7.5)) = sa2; 
(5 — 0.5*(2.5)) = sbl; 
(5 — 0.5*(7.5)) = sb2. 
A =al +a2;al <5; B=b1+ 52; bl <5; [Res]A +B < 14. 


The slopes are the same, as is the solution, but this entire model including the 
slope definitions are linear. Compare this solution with that of Exercise 3 in Chap. 
11. 


(c) If the objective were to find the maximum equal income the model and solution 


are: 


Maximize EqualIncome; 
sal*al + sa2*a2 > equalincome; 
sb1*b1 + sb2*b2 > equalincome; 
(10 — (2.5)) = sal: 
(10 — (7.5) = sa2: 
(5 — 0.5*(2.5)) = sbl; 
(5 — 0.5*(7.5)) = sb2. 
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A =al +a2;al <5; B=b1+52; bl <5; [Res]A +B < 14. 


Variable Value Reduced Cost 

equalincome 25.71429 0.000000 
sal 7.500000 0.000000 

al 3.428571 0.000000 
sa2 2.500000 0.000000 
a2 0.000000 0.714286 
sbl 3.750000 0.000000 
bl 5.000000 0.000000 
sb2 1.250000 0.000000 
b2 5.571429 0.000000 
A 3.428571 0.000000 

B 10.57143 0.000000 


Row Slack or Surplus Dual Price 
RES 0.000000 1.071429 


9. Solving Models Using Calculus 
1. Warmup. 


The following examples show that if you want to compute the average value 
of a function over a range of values, you should compute the average of different 
functional values rather than computing the function’s value of the average input 
value. 


10x — X? concave 
5X linear 


x? convex 
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X 10X-X? 5X X2 


0 0 0 0 

1 9 5 1 

2 16 10 4 

3 21 15 9 

4 24 20 16 

5 25 25 25 
Arithmetic- 15/6 95/6 75/6 55/6 
Mean, AM 2.5 15 5/6 12.5 9 1/6 


Consider each of these functions: 
Note that: 
For concave functions: 


Mean of function values < function value for mean x. 


15 5/6 = 15.83 <  10(2.5)—2.5? = 18.75 
For convex functions: 


Mean of function values > function value for mean x 


91/6 =9.167 > 2.57 =6.25 
For linear functions: 


Mean of function values = function value for mean x 
12.5 = (5)2.5= 12.5 


Show that the true mean is between these two values for each function. 

One can integrate the function and divide by the interval over which it applies. 
In this case one-fifth of the integral from 0 to 5 of (10x—x?) is (2/3) 52 = 16 2/3 
and a fifth of the integral of x? from 0 to 5 = 8 1/3. 

This shows that: 

For concave functions: 


Mean of function values < truemean < function value for meanx. 


155/6 < 162/3 < 18.75 
For convex functions: 


Mean of function values > truemean > function value for mean x 
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91/6 > 81/3 > 6.25 
For linear functions: 


Mean of function values = true mean = function value for mean x 


125 = 12:5 =12.5 


Calculating the mean based on the two end points of concave or convex functions 
is assuming they are linear. It underestimates the mean for concave functions, and 
overestimates the mean for convex functions, as shown in the above figure. 


1. Benefit Cost analysis. 


Assume a benefit function B = 60*x?8 and a cost function C = 4 + 7*x!5. The 
maximum difference between B and C are the net benefits, NB. 


(a) Find the value of x that results in the maximum net benefits. 


dNB/dx = 0 = d (60 x8 = (4 + Tx!8)) fax 


0.8 (60) /x°? = 1.5(7)x°° or x°7 = (48/10.5) 
Thus, it occurs whenx = 8.768622. 


(b) Would an increase in the fixed cost of 4 affect the value of x? 


Solution : It could if it caused the cost function to be above 


the benefit function, in which case x would = 0. 
2. Water supply utility 


You are a mayor of a town that is considering privatizing the public water 
supply system. Currently the public water supply system is operating in such a 
way that maximizes the benefits to its consumers (willingness to pay) while still 
paying for the service. No profit is made. If it is privatized, the private company 
will want to maximize its profits (revenue less costs). 


For example, consider the functions shown below: 


The horizontal axis is the amount of water delivered, and the vertical axis is money 
representing the unit price of water charged, the total and marginal costs and the 
total and marginal revenue. 


Willingness to pay is the area under the demand curve. 
Public utility objective: maximize willingness to pay less cost of supplying water. 


Private utility objective: Maximize total revenue less cost of supplying water. 
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Total revenue is unit price times the quantity Q sold. 


Demand Function Q = f(P) 


Unit 
Price P Total consumer benefit 
i a “Consumer surplus” 
Unit 
CostC “` 
Pouh 
i Qouh Water Quantity Q 
Total cost. 
Public Utility 
Unit : f 
Price P Demand Function Q = f(P) 
ie „~ Consumer surplus (benefit) 
coste B Marginal Total 
Poi >< Revenue 
Pan- aN- 
Profit / j Qi Qnuh Water Quantity Q 
“Producer’s surplus” / 


Total cost. 
Private Utility 


For an amount of water Q assume the total cost = 5Q and the demand function = 
unit price = 12—1.5Q. 
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`Total Cost 


Unit Price 


Quantity Q 


Given these data, find the best amounts of water to deliver and the associated 
unit prices to charge for both a public and private utility. The public utility should 
maximize consumer surplus (willingness to pay less its costs, and the private utility 
will maximize its producer surplus or profit (subject to any regulations it must 
meet. In this example there are none.). 


Find the solutions and graph the solutions like in the figures above. Identify on the 
graph the consumer’s surplus, producer’s surplus, and total cost. 


For a public utility what should the unit price be for the water supplied, and how 
does it compare to the marginal cost? 


For a private utility what should the unit price be for the water supplied, and how 
does it compare to the marginal cost? Hence what is the unit and total profit? 


Solution : 
Private utility : Maximize Net Revenue = Total Revenue — Cost 
Total revenue = price * quantity = (12 — 1.5Q)Q 
Cost = 5Q 
Maximize(12Q — 1.50? — 5Q) 
d(120 — 1.597 — 5Q)/dQ = 0 = [12 — 2(1.5)Q0]- 5 = 7 — 3Q 
so Qpri = 7/3 = 2.33 
This occurs when the marginal revenue = marginal cost = 5. The unit 
price = 12 — 1.5(7/3) = 8.5. Hence, the unit profit = 8.5 — 5 = 3.5. 
Total profit = 3.5(7/3) = 8.167. 
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-Total Cost =5Q 


e2?” 


_ Unit Price = 12 — 1.5Q 


a 
/ 


Marginal (unit) cost = 5 


` Quantity Q 
Marginal‘Revenue = 12 - 3Q 


Public Utility : Maximize Consumer Surplus : 
Consumer surplus = 0.5(q*)(Po — (Po — 1.5q*)) + q* (Po — bq“) — 5q* 
Maximize [0.5(12 — (12 — 1.5Qpub))Qpub + Qpub(12 — 1.5Qpub] — 5Qpub 
= —0.750%,4 + 7Qpub 
d(-0.750%,, x 7Q pw) (020 = 18055 47 
Thus, Qpub = 7/1.5 = 4.67 
Note : Qnuy(4.67) equates unit price (12 — 1.50(4.67)) to marginal cost (5). 


_-- Total Cost = 5Q 


Qni ‘ Qour Quantity Q 
Marginal Revenue = 12 - 3Q 


11. Lagrangian Models 


1. Benefit Cost analysis. 
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Assume a benefit function B = 60*x*.8 and a cost function C = 4 + 7*x*1.5. The 


maximum difference between B and C, the maximum net benefits, occurs at x = 
8.7686. 


(a) Would an increase in the fixed cost of 4 affect the value of x? 


Solution: It could if it caused the cost function to be above the benefit function, in 
which case x would = 0. 


(b) Use a Lagrangian model to find the value of the shadow price, or Lagrangian 
multiplier, if x cannot exceed 5. What does the multiplier signify? 


Solution : 
L = 60*x?8 — (4 + Ta] — (x —5) 


0 = 0.8*60*x OSD — 1.5*7*x0-9) — 4 


0 =x-—5 an equality since x wants to be more than 5. 


When solved, x = 5 and à = 11.31, the change in net benefits for a change in 5. It 
is the slope of the net benefit function at x = 5. 


2. Allocating resources 


Consider the problem of allocating resources to three users. The allocations are X, 
Y, and Z. User 1’s total revenue is 6X/(1 + X). User 2’s total revenue is 7Y/(1 + 
Y). User 3’s total revenue is 8Z/(1 + Z). Assume 10 resources are available. 

Show how to find the allocations that maximize the total revenue from all three 
users, and the associated shadow price of the resource constraint, using Lagrange 
multipliers. Compare that solution with one obtained from solving the model itself, 
say using Solver in Excel. 


Setting the slopes equal and to the shadow price À : 
6/0 +x)? =a; 7/1 +y)? =a; 8/ +z) =a; andxt+y+z=10. 


Results in : 
Variable Value 
x 3.018770 
X 0.3715058 
y 3.340768 
Z 3.640461 
Compared to: 


Maximize[6*x/(1 +x) + 7*y/(1 + y) + 8*z/(1 + z)] subject tox +y +z = 10. 
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Objective value: 16.17042 
Variable Value 


x 3.018766 
y 3.340763 
Z 3.640471 
Row Dual Price X 


Resource constraint 0.3715060 


3. There are two users of resources, A and B, whose income depends on the 
resources they are allocated. Let those allocations be A and B respectively. The 
income to user A equals 10A—0.5A?. The income to user B is 5B-0.25B?. You 
wish to know what allocations result in the maximum total income. You only 
have 14 resources to allocate and are curious what marginal increase in total 
income could result if you had a little more resources. 

Solving a Lagrange model: Noting that the maximum income values for A and 
B are 10 each, thus the constraint A + B < 14 will be an equality, 


L = (10A — 0.5A?) + (5B — 0.258?) — X(A+ B — 14) 
dL/aA =0=10 -A-7 
dL/aB = 0 = 5 — 0.5B — 
dL/axX =0=A+B-— 14 


From these equations, A = 8; B = 6; à = 2, the marginal income gain for a unit change in 14. 


12. Dealing with Uncertainty 


1. You have a job that requires you to be protected some of the time. The proba- 
bility that the needed hours of protection, P, will be less than p is 0.2p—0.01p°. 
The cost of protection is $50 each hour. What is the expected daily cost for 
your protection? 


The cumulative distribution 0.2p — 0.01p? equals 1 when p = 10. 
The probability distribution of P must be d(0.2p — 0.01p?)/dp 
= 0.2 — 0.02p for p < 10. 
The area of the distribution = 1 when p = 10. 
One minus the cumulative distribution (1 — 0.2p + 0.01p7) 
is the exceedance distribution. 
Area under the exceedance distribution is the mean of P. 


Expected cost = $50 times mean P. 
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10 
Expected cost from protection = $50 i {1 — (0.2p — 0.01p^2)}dp 
0 
= $50(10 — 10+ 10/3 = mean value of P = 0.33) = $166.66 per day. 


2. Probability of being flooded. 


The probability of a flood expected to be exceeded once in n years on average is 
called the n-year flood. What is the probability of observing at least one 100-year 
flood or greater over a 30-year period, assuming annual floods (maximum flows in 
a year) are independent events? 


Solution : Find 1 — probability of not being flooded in 30 years. 
Probability = 1 — (1 — 1/n)*°. If n = 100, 
Probability of seeing at least one flood 
exceeding the 100—year flood is 1 — (1 — 1/100)*° 
= 1 — 0.99% = 0.26. 


3. State Lottery 


You are asked to establish a State lottery where the cost per ticket is $1. Each 
ticket has a 3-digit number; each number is equally likely. Owners of winning 
tickets receive $500 for each winning ticket. 

Suppose you buy | ticket a week for an entire year, i.e., 52 tickets. 


(a) Show how to calculate the probability that you will win one or more lotteries 
in the year. (The answer is 0.0507.) 


Probability of winning on any week with any number is 1/1000. 
Probability of losing every time = (1 — 1/1000)>? 


1 — (999/1000)°* = 0.0507 = probability of winning at least1. 


(b) If the lottery sells 1,000,000 tickets this week, what is the expected income 
to the State? Note: The expected income of | million tickets is the expected 
income from one ticket times 1 million. 


Solution : 
Expected income of each ticket = —499(0.001) + 1(.999) = 0.500 
Thus for 1,000, 000 tickets, expected income = $0.50(1, 000, 000) 
= $500, 000 
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(c) Show how to calculate the variance of this income. 


Solution : 
Variance = [(—499 — 0.5)*2](0.001) + [(1 — 0.5)*2](0.999) = 249.75 
S tan dard deviation = (249.75)^0.5 = 15.8 
Variance of 1,000, 000 sales = 1, 000, 000(249.75) Std. Dev = 15.8*1000 


4. Book sale 


Twice a year a town has a used book sale, and at the end of the sale they offer any 
book they have for $1. The cost of handling books is estimated to be about $0.65 
per book. How many books should they have available to maximize their expected 
net revenue from the sale? 

Past sales indicate that the probabilities of various ranges of books being 
demanded is as follows: 


Hundreds Probability Average 
of books of demand Exceedance Pr(exceedance) 


0-2 0 1 1 
2-4 0.1 1-0.9 0.95 
4—6 0.4 0.9-0.5 0.7 
6-8 0.4 0.5-0.1 0.3 
8—10 0.1 0.1—0 0.05 
10—12 0 0 0 


Maximize NetBenefits 
NetBenefits = ben — cost; 
cost = C*x; C = 65; 
Ben = 100* (xb1 + xb2*(1 + .9)/2 + xb3*(.9 + .5)/2 
+ xb4*(.5 + .1)/2 + xb5*(.1/2)); 
xbl + xb2 + xb3 + xb4 + xb5 = X. 
xbl < 2; xb2 < 2; xb3 < 2; xb4 < 2; xb5 < 2. 


Variable Value 
NetBenefits_ 140.0000 


Ben 530.0000 
Cost 390.0000 
X 6.000000 hundred books 
xbl 2.000000 
xb2 2.000000 
xb3 2.000000 
xb4 0.000000 


xb5 0.000000 


Exercise Solutions 285 


5. Bake sale 


The mayor is considering having a $100-dollar a plate dinner to increase the funds 
available for the homeless. Her problem is that she doesn’t know how many people 
might come. Experience suggests that it largely depends on whether it rains or not. 

The local weather service has indicated that the probability of a dry day is 0.70. 

Invitations must be sent out two weeks in advance. 

If it doesn’t rain there is an 80% chance that 500 people will attend, and a 20% 
chance that only 300 will attend (just to make it simple). If it rains, there is a 
60% chance that 350 will attend and a 40% chance that only 200 will attend. Each 
dinner ordered in advance costs $20. Everyone that comes must be served dinner. 
If additional dinners must be ordered because of a shortage, they cost $30 each. 


(a) How many dinners should the mayor order in advance of knowing how many 
will attend the dinner? 

(b) What is the maximum amount the mayor would be willing to pay for a weather 
forecaster that could predict for certain whether or not rain would occur on a 
particular day? The date of the dinner could then be set after such a forecast 
is made. 


Probability of rain = 30%. 

Let X = number of dinners ordered in advance (at a cost of $20 each). 

Let A = number of additional dinners ordered to make up demand. (at a cost of 
$30 each.) 

Let E = excess dinners not used. (at a cost of $20 each) 


Define Demand (i) = X + a(i) — e(i) for each possible outcome i. 
500 = X +al—el with joint probability 0.7(dry weather)*0.8 (500 
will attend) = 0.56 
300 = x +a2 — e2 withjoint probability 0.7(dry)*0.2(300) = 0.14 
350 = X +a3 — e3 with joint probability 0.3(wet)*0.6(350) = 0.18 
200 = X +a4— e4 with joint probability 0.3(wet)*0.4(200) = 0.12 


Joint probability of outcomes i 
= Probability of weather* Probability of attendance|weather 
The model objective : Maximize expected net income = 
(100 — 20)X + [(100 — 30)al — (80 + 20) e1]0.56 
[(100 — 30)a2 — (80 + 20)e2]0.14 
[(100 — 30)a3 — (80 + 20)e3]0.18 
[(100 — 30)a4 — (80 + 20)e4]0.12 


+++ + 
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Note: X is hopefully sold and is assumed to be in the first term of the objective 
function. If less than X is sold, the excess is ‘E’ and the excess $80 profit included in 
the first term in the objective function that assumes X is all sold must be subtracted 
from the objective function. 

Solution: 


Global optimal solution 


Objective value : 31380.00 


Variable Value 

X 350.0000 
a(1) 150.0000 
e(1) 0.000000 
a(2) 0.000000 
e(2) 50.00000 
a(3) 0.000000 
e(3) 0.000000 
a(4) 0.000000 
e(4) 150.0000 


The maximum amount the mayor would be willing to pay for a weather forecaster 
that could predict if rain would occur on a particular day would be the difference in 
expected income resulting from a dry day compared to previous solution based on 
expected weather values. To determine this value maximize expected income from 
dinners assuming no rain and subtract the expected net income without forecasting 
as obtained from the above model. That difference in expected income is the most 
she would be willing to pay for perfect weather forecasting. 


Maximize (100 — 20)*X + (70*al — 100*e1)*0.8 
+ (70*a2 — 100*e2)*0.2. 


Objective value : 36000.00 


Variable Value 

X 500.0000 
a(1) 0.000000 
e(1) 0.000000 
a(2) 0.000000 
e(2) 200.0000 


In this case the most one would pay for perfect forecasting = 36,000-31,380 = 
4620. 
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6. Finding means, variances, medians. 


For the following probability density functions, f(x), of a random variable X, inte- 
grate them to find the equations for the cumulative distribution functions, F,(x), 
(ranging from 0 to 1), and the median, mean and variance of each of the distribu- 
tions. Finally, compute the area under the probability of exceedance function, 1- 
F(x). 


Uniform distribution. Since area under distribution is 1, f,(x) = 0.1 for 5 
f(x) <x < 15 and 0 otherwise. 

F(x) = ÍZ" 0.1 dx = 0.1x |5“ =0.1x* - 0.1°5 for 5<x* < 15 

Thus F(x) = 0.1(x-5) for 5 < x < 15, 0 for x < 5, and 1 otherwise. 


= å R Median when F,(x) = 0.5, so x = 10. 


0 5 10 15 Mean =Í; x f,(x) dx = 0.05x? at x = 15 - 0.05x*at x =5 
= 0.05(15* - 5?) = 10. 
Variance = [<5 (x-10)? fx(x) dx = 0.1(x?/3 — 10x? + 100x) 
Evaluated for x= 15-forx=5: 8.333 
Area under Prob. of Exceedance: 1-F,(x) 
5+ [535 (1-0.1(x-5)) dx = 5 + 1.5(15) — 157/20 — (1.5(5) — 57/20) = 10 


Triangular distribution. Since area under distribution is 1, f,(x) = 0.2 - 
0.02(x-5) for 5 <x < 15 and 0 otherwise. 


Thus F,(x) = 0.3x — 0.01x? -1.25 for 5 <x < 15, 0 for x <5, and 1 otherwise. 


fx(x) 
Median when F(x) = 0.5, so x = 7.928932 


Mean =Í x fx(x) dx = Js*°x (0.2 — 0.02(x-5))dx = 
0.3x?/2 - .02x°/3 for x = 15 - .3x?/2 - .02x°/3 forx=5 
0 5 1 15 x = 8.333333. =m 


Variance = Í (x-m)? f(x) dx = [535 (x?-2mx +m?) (0.3-0.02x) dx 
=.1*x*3 - (.02/4)*x*4 -.3*mean*x*2 + 
(2*mean* .02/3)*x*3 + (.3*mean*2)*x - 
(.01*mean*2) *x*2 
Evaluated for x= 15—forx=5 is 5.555 

Area under Prob. of Exceedance: 1-Fx(x) 

5 + f535(1 — (0.3x — 0.01x? -1.25)) dx = 5+ (.01x?/3 - .3x?/2 + 2.25x) |5? = 8.33 
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Triangular distribution. Since area under distribution is 1, 
f(x) = _0.02(x-5) = 0.02x-0.1 for 5 <x < 15 and 0 otherwise. 


Thus Fx(x) = 0.01x? - 0.1x + 0.25 for 5 <x < 15, 0 for x <5, and 1 otherwise. 
Median when F(x) = 0.5, so x = 12.07107 


Mean =Í x fx(x) dx = Js** x (0.02(x-5))dx = Í -0.1x + 0.02x? dx = 
(.02x?/3 - 0.1x?/2) for x = 15 - (.02x°/3 - 0.1x?/2) forx=5 
x= 11.667 =m 


f(x) Variance = Í (x-m)? f(x) dx = J5*° (x?-2mx +m?) (0.02x - 0.1) dx 
= .02*x*4/4 (.1*x*3/3) -2*m*0.02*x%*3/3 + 
2*m*0.1*x*2/2 + .02%x%2* m*2/2 - 0.1*x*m*2 
Evaluated for x= 15-forx=5 is 5.555 
Area under Prob. of Exceedance: 1-F,(x) 
S+ J*°(1- (0.01x? — 0.1x + 0.25)) dx = 5 + (0.75x -—0.01x°/3 + 0.1x?/2)|s*° = 11.667 


Triangular distribution. Since area under each half of the distribution is 0.5, _ 
f falx) = 0.04(x-S) or .04x - 0.2 for S<x<10 and 0.2 —0.04(x-10) or 0.6 — 0.04x 
f(x) for 10 < x < 15 and 0 otherwise. 


Thus Fx(x) = 0 for x < 5; 0.02x? —0.2x + 0.5 for 5 sx s 10, and 
0.6x — 0.02x? — 3.5 for 10 s x $ 15, and 1 for x 2 15. 


Median when F,(x) = 0.5, hence x = 10. 


Mean =Í x f(x) dx = J,%° x (.04x - 0.2 ) dx + fio" x{ 0.6 — 0.04x) dx = 
-0.1(10)? + 0.04(10)/3 + 0.1(5)? — 0.04(5)?/3 + 0.3(15)? — 0.04(15)*/3 — 
(0.3(10)? — 0.04(10)?/3) =x_= 10 =m 


Variance = Í (x-m)? fix) dx = J! (x?-2mx +m?) (0.04x — 0.2) dx + 
Sao! (2-2mx + m?) (0.6 — 0.04x) dx = 
J,?° (0.04x?-2(.04)mx? + 0.04m?x — 0.2x? + 0.4mx - 0.2m? ) dx + 
| (0.6x?-2(0.6)mx + 0.6m? - 0.04x" + 2m0.04x? — 0.04m’x ) dx = 
(0.01x*-2(.04/3)mx* + 0.02m?x? — 0.2x°/3 + 0.2mx? - 0.2m?*x )[x=10] 
~ (0.01x*-2(.04/3)mx? + 0.02m?x? — 0.2x?/3 + 0.2mx?* - 0.2m?x ) )[x=5] + 
(0.2x* - (0.6)mx? + 0.6m?x - 0.01x* + 2m0.04x*/3 — 0.02m?x’? ) [x=15] 
~ (0.2x* - (0.6)mx? + 0.6m?x - 0.01x* + 2m0.04x"/3 — 0.02m?x? ) [x=10]= 4.1667 


Area under Prob. of Exceedance: 1-F,(x) 
5+ [5*9 (1-( 0.02x? — 0.2x + 0.5))dx+/,0*°(1-(0.6x — 0.02x? — 3.5))dx = 
5+(.5x + 1x? - ,02x?/3) 15° +(4.5x - .3x? + .02x?/3)] 10°% = 10 


7. Swimming 


Assume the admission to a public outdoor swimming pool in an urban area costs 
$5 per person. Also assume the probability distribution of tickets sold per hour is 
uniform from 5 to 15, (as shown above in question 5). Find the expected revenue 
per hour. (You should be able to guess at the expected number of people buying 
tickets and that times $5 will be the expected revenue.) 

Assume the number of tickets sold/hour can range from 5 to 15. 


Probability of exceedance = 1 for x < 5, 
= (1 — .1(x — 5)) for 5 < x < 15. 
= 0forx > 15 


Exercise Solutions 289 


Hence expected revenue = 
15 


$5(5) + $5 f (1 — .1(x — 5))dx = 25 + s| (1-5x/2 2 1x? /2)|5 | 
5 


= 25 + 5*((1.5*15 — .1*15^2/2) — (1.5*5 — .1*5%2/2)) = 50 
If you have only x number of tickets to sell, expected income is $5x 
if x <5, and 25 + 5*((1.5*x — .1*x^2/2) = (1.5*5 = .1*5°2/2))for 5 <x< 15 


8. Planning a Park 


A recreational park is being planned. It borders a lake. Planners need to decide at 
what lake level to build the recreational facilities such as docks, boat landings, pic- 
nic benches, tables, fireplaces, restrooms, etc. The potential benefits derived from 
these facilities increase with increasing lake level elevations due to the increasing 
shore-line perimeter (length) and flatter areas to develop. 

The developers assume the marginal benefits obtained will equal $5 per unit 
target level if the actual lake is at that target level. But the lake level varies over 
the recreational season. No matter what level is chosen as a target level for devel- 
opment, the actual level will likely differ. The developers estimate there will be a 
loss of $7.5 per unit deficit (difference between target level and lower actual level) 
or a loss of $1 per unit excess if the lake level is above the target level. 

For example, if the target level is 5, but the actual level is 4, the net income 
will be $5(5)-(5-4)7.5 = 17.5. If the actual level is 6, the net income will be 
$5(5)-1(6-5S) = 24. 

The probability distribution of lake levels during the recreational season varies 
over a range of 0 to 10 units uniformly. What level within that range from 0 to 10 
should be the target level that maximizes expected net income? 

Discuss a modeling approach you would use to find the best value of the target 
level, and demonstrate its use. 


Solution suggestions: 
Simulation by trial and error: 
Pick a target level between 0 and 10. 


For each successive period t, generate a probability P(t) uniformly distributed from 
0 to 1. This can represent a value of the cumulative distribution of the uniform 
probability distribution from 0 to 10. 

In Excel you can use the function 10Rand() to generate uniformly distributed 
lake levels from O to 10. Then depending on whether the level is below or above the 
target, compute the net benefits, e.g., using the Excel functions IF or MAX. From 
multiple samples compute the expected benefits associated with that target level. Do 
this for various target levels and select the best. 

For example: Select a target T. Generate a value of X using x = 10Rand(). 
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Net benefits = 5(T)-E[Losses] 


Losses = IF (x < T,7.5(T — x), I(x — t)) 
ComputeE[Losses|by dividing sum of all Losses by number of samples of x. 


© Use of optimization: 


Alternatively, you can define an optimization model having the objective of 
maximizing the expected net benefits. 

Max 5*target—ELoss. 

The expected losses, ELoss, will involve the sum of two integrals, one for finding 
the expected losses associated with lake level deficits in the range from O to the 
unknown target, and the other for finding the expected losses from lake level excesses 
in the range from the target to 10. They must be integrated before using Excel to find 
the best value of the target. 

Solution: Let target = T. the probability density function is f(L) = 1/10 from 0 to 
10. 

ELoss =f from 0 to T: 7.5 (T—L) (f(L) dL + f from T to 10: 1(L-T)(f(L))dL 

ELoss =f from 0 to T: 7.5 (T—L) (1/10) dL + f from T to 10: 1((L-T)(1/10)dL 

ELoss = {7.5/10 [((TL—L*2/2) IOT + 0.1 (L^2/2 — TL) IT10} 

This results in a function of T, the target, {5 T — Eloss} that can be maximized. 

Let t be the target level. 

Maximize 5*t — ELoss 

ELoss =(7.5 * O.1*(t*t — (t°2)/2) — (t*0 — 0°2/2)) + 1*0.1*(10°2/2 — 10*t 
— (t°2/2 — t*t)) 

(5 + 17.5 + 1) = P = value of cumulative distribution at optimal target t. 


Objective value: 16.17647 


Variable Value 


t 7.058824 
ELOSS 19.11765 
P 0.7058824 


The value of the cumulative distribution function, F x(t), associated with the 
optimal value of the target t, depends on the slopes of the benefit and loss functions 
only and not the distribution function. 

If the lake level range were other than from 0 to 10 or if their probability distri- 
bution was not uniform, the value of P does not change, and for any distribution, P 
would define the cumulative distribution at the target value. 


9. Birthday problem 
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What is the probability P of at least two in a group of n people having the same 
birthday (month and day)? Write the expression for P. 

Solution: 1—probability of n people having different birthdays 

Probability of 2 people having different birthday = probability of second person 
having different birthday than first = 364/365 thus 

probability of 2 people having same birthdays = 1—364/365 = 1/365 

Probability of 3 people having different birthday = probability of 2 not having 
same birthday, 364/365, and probability of third not having birthday on either of the 
2 days the other two have them. 363/365. 

Thus, probability of at least 2 having same _ birthday! — 
(364/365) (363/365) (362/365) 

Probability of 4 people having different birthday = probability of 2 not having 
same birthday, 364/365, and probability of third not having birthday on either of 
the 2 days the other two have them, 363/365, and probability of fourth not having 
birthday on either of the 3 days the other three have them 362/365. 


Thus, the probability of at least 3 having same birthday = 
1 — (364/365) (363/365) (362/365) 


In general : 
n 

1- Il [365 — (i — 1)/365] = P 
2 


Solutions: n P 

20 0.44 
25 0.60 
30 0.73 
40 0.90 
50 0.97 


60 0.998 
10. Heart Attacks 


Serious heart attacks occur in a county on average once every two weeks, but they 
are random. How many heart attacks should the physicians expect to respond to 
in a single year, on average? 


Obviously 26 since there are 26 two— week periods ina year. 


What is the probability that at least two heart attacks will occur on the same 
day? 

Suppose the ith heart attack occurs on day Di, one of the 365 days of the year. 
There are 365 possibilities. There are 365° possibilities of combination of days that 
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two heart attacks might occur. There are 365" possible combinations of days or 
sequences of days that n heart attacks might occur. 

The probability that 2 heart attacks will occur on different days is 364/365. The 
probability that 3 heart attacks will occur on different days is (364/365)(363/365). 
The probability that n> 1 heart attacks will occur on different days is 


(364/365) (363/365) . . . ..(365 — n + 1)/365 = 365!/[365" (365 = n)!] 


When n = 26, probability = 0.40 = probability of none of the 26 heart attacks 
will occur on same day. 

Thus, there is a 

1—probability that heart attacks will occur on different days = 1-0.4 = 0.6 
chance that 2 or more heart attacks may occur on same day. 


11. Taxicab problem 


Three taxi stands that are serviced by taxi company: Sites A, B, and C. 
Three policies have been tested but not analyzed: 


Policy 1: cruise around the site and pick up first person wanting a ride. 
Policy 2: return to nearest taxi stand and wait for rider. 
Policy 3: wait at nearest site for radio call. (Not available at B) 


Questions: 


e What is best policy at each site? 

e Given best policy, what is probability of being at each site? 

e Given best policy, what is expected net income from each rider picked up at 
each site? 

e What is the overall expected net income per rider? 


To answer the questions, you will need data. 
Data: 
Average costs, Cik, of policy k at site i and resulting trip count: 


Site i Policy k Cik No. of trips to site j: Probabilities Pijk = Plik) 
A B >») A B C 
A 1 3 36 18 72 0.5 0.25 0.25 
2: 5 4 48 64 1/16 0.75 3/16 
3 9 8 4 32 0.25 1/8 5/8 
B 1 1 45 90 0.5 0 0.5 
2 6 5 70 80 1/16 7/8 1/16 
C 1 2 15 15 60 0.25 0.25 0.5 
2 4 8 48 64 1/8 0.75 1/8 
3 5 36 3 48 0.75 1/16 3/16 
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Average travel costs, TCij, between sites i and j: 


Sitei Sitej TCij 


OwWwDrpry, 
NANWADADWD DL 
NUNN APR 


Average income, Yijk, costs, 
destination j: 


Site i Policy k Site j Yijk TCij Cik Rijk 
A 1 > A 1 3 10 
2 => A 1 5 8 
3 > A 1 9 4 
A 1 > B 4 3 4 
2 > B 4 5 2 
3 > B 4 9 6 
A 1 > C T 3 8 
2 > Cc 7 5 4 
3 > Cc 7 9 4 
B 1 > A 4 1 14 
2 > A 4 6 8 
B 1 => B 2 1 0 
2 > B 2 6 16 
B 1 => C 5 1 18 
2 > C 5 6 8 
C 1 > A 7 2 10 
2 > A 7 4 6 
3 > A 7 5 4 
C 1 > B 5 2 2 
2 > B 5 4 4 
3 > B 5 5 0 
C 1 > C 2 2 8 
2 > Cc 2 4 2 
3 > C 2 5 8 
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LP Model: 


Maximize X` Y De PikPijkRijk 
l J 
Subject to : 
Pj= 2 a PikPijk j= A,B,C 
> Pik =Pi i=A,B,C 


pies =1. 


Solution: Objective value 13.34 = expected return per trip 


Variable Value 

P11 0.000000 

P12 0.06722689 indicates if at site A follow policy 2 
P13 0.000000 


P21 0.000000 indicates if at site B follow policy 2 
P22 0.8571429 


P31 0.000000 
P32 0.07563025 indicates if at site C follow policy 2 
P33 0.000000 


P1 0.06722689 
P2 0.8571429 
P3 0.07563025 


Steady-state probabilities of being at each site: 


State A : 0.0672 = Pa = Pa2 
State B : 0.8571 = Pb = Pb2 
State C : 0.0756 = Pc = Pc2 


Expected gains given state and best policy: 
g(a) = De Paj2 Raj2 = (1/16)8 + (0.75)2 + (3/16)4 = 2.75 
g(b) = X. Phj2 Rbj2 = (1/16)8 + (7/8)16 + (1/16)8 = 15.0 
J 


g(c) = D Pcj2 Rcj2 = (1/8)6 + (0.75)4 + (1/8)2 = 4.0 
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Overall maximum expected return = 


D Pig(i) = 0.0672 g(a) + 0.8571 g(b) + 0.0757 g(c) 
= 0.0672(2.75) + 0.8571 (15) + 0.0757(4) = 13.34 


12. Public library 


A town’s public library needs more space. Recently the town had to decide whether 
to relocate or renovate their public library. The old, and now empty, Woolworth 
Store was a potential new location. A Foundation indicated they would give the 
town $2.5 million if they immediately chose the Woolworth Store. This gift would 
help pay the estimated relocation cost of $9.5 million. It was not clear that the 
Foundation would give the $2.5 million to the town if the town chose to renovate 
the existing library or to delay the relocation decision to first determine if the 
Woolworth Store could be rented. 

The debate over what to do centered on the question of whether the Woolworth 
Store could be rented, and hence generate tax revenue for the town. If the library 
were moved to the old store, there would be no tax revenue derived from that 
store but there would be some income derived from the sale of the existing library 
building—if they could sell it. 

Assume that when the Foundation made the offer, you were asked to help the 
town decide what to do. 

You reason the town has some choices: It could decide to move its public 
library to the old Woolworth Store, or it could hire a consultant to evaluate the 
suitability of that store for another business and to obtain a better estimate of the 
likely income from the sale of the existing library building. If the town decides to 
move the library, the Woolworth relocation cost would be $7 million ($9.5 million 
less the Foundation gift of $2.5 million) and take two years. If the town hires a 
consultant, the consultant will charge the town $100,000 and require 6 months to 
make a recommendation. The benefits of a relocated or renovated library would 
be delayed by the additional 6 months required by the consultant. 

If the consultant is hired and indicates the old Woolworth Store has no commer- 
cial value, then the relocation process could take place immediately, at a cost of $7 
million or $9.5 million, depending on whether the Foundation gives the town $2.5 
million, less the expected income from the sale of the existing library building. On 
the other hand, if the consultant indicates the old store has commercial value, the 
town could act immediately to renovate the existing library, or it could wait and 
try to rent the store over the coming year. If after a year the store is not rented, 
the town would relocate the library. The relocation costs and time remain the same 
as before: $7 million or $9.5 million over two years, depending on whether the 
Foundation gives the town $2.5 million, less the expected income from the sale 
of the existing library. In addition, the benefits of not having a new facility are 
further delayed by the waiting period, say a year. 

Renovation of the existing library will take 2 years and cost $13.5 million 
or $11 million, again depending on the Foundation’s $2.5 million gift decision, 
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less the expected capitalized tax revenues from the rental of the Woolworth Store 
(considering the possibility that it might not be rented). 

If the town waits to see if it can rent the store, and succeeds in renting the store, 
say in a year, then it can begin the renovation of the existing library, again at a 
cost of $13.5 million or $11 million, depending on the Foundation’s $2.5 million 
gift decision, plus the lost benefits to the library users of delaying another year, 
less the capitalized (present value of the) tax revenues from renting the Woolworth 
Store. 

Show how you would determine how to advise the town. Should the town 
relocate its library now or hire a consultant? What are your decision criteria? 
What probabilities do you need to estimate to answer this question? What other 
assumptions do you have to make? How would you determine how sensitive your 
recommendation is to all those assumptions? 


Define : D = loss from use of new library per year of delay. 

SV = income from sale of existing library times probability of it being sold. 
PF (i) = probability of Foundation giving 2.5 Munder situation i. 

PV = probability consul tant indicates store has commercial value. 

PR = probability of renting store 

PR = probability of renting store derived from store rental 


Relocate: Cost = $7M - SV 


we Consultant. Cost: $100.000 + 0.5D 


No value: Relocate. Cost = (1-PV)[$9.5M-2.5M(PF(1)) — SV] 


Store has Value 


Don’t wait: Rennovate. Cost: $13.5M — 2.5(PF(2)) — CI(PR) 


Wait a year. Cost = D 


Store Rented. Rennovate. Cost = (PR)[$13.5M — 2.5(PF(3)) — CIJ 


Store not rented. Relocate. 
Cost = (1-PR)[9.5M — 2.5M(PF(4)) — SV] 


Work backwards from each endpoint to get expected costs. Squares are decision 
points; circles are chance events. 

Perform sensitivity analyses on all assumed values, including probabilities, to see 
how sensitive your decision is to those assumed values. 


13. Immigrants 
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Suppose you are a designer of a facility to temporarily house immigrants entering 
the country. The number of immigrants needing housing in the facility each week 
varies. Data exist that allow you to calculate the probability distribution of the 
number of people needing housing each week. Let P represent the discrete random 
variable for the number of people needing housing, and Pr(p) be the probability 
that P = p. The sum over all p of Pr(p) equals 1. 

Your job is to determine the target population level of your new facility, realiz- 
ing that you may have more or less than that target level each week. Those running 
the facility will get paid a certain amount based on both the target capacity of the 
facility and the actual average number in the facility each week. 

The revenue obtained from having an amount equal to the target population, 
T, are defined by the concave function R(T) as shown below. (Note, if T were 20 
and 20 people were housed, the benefits would equal —5 + 16(13) + 8(7). The -5 
reflects fixed costs if the facility is built. If it is not built, T = 0 and R(T) = 0. 


R(T) 


Target population T 


If the number of people in the facility is not equal to the target value T, there 
is a reduction in total net revenue. For each person less than the target, there is a 
loss of $21. For each person more than the target there is a loss of $3. 

The loss function is shown below. Note: Losses are a function of the deviations 
from the target population T and are independent of the value of the target value, 
T. 


Loss 


$3 


T Population housed 


So, suppose the T is 20 and the actual amount received is 15. The total net 
benefits would equal R(T) — 21(20-15) = -5 + 16(13) + 8(7) — 21(5). 

Develop a linear model that will find the value of the target number T that 
maximizes the expected total net revenue. (Note: Total expected net revenue is 
targetthe revenue obtained from target T less expected losses from deviations from 
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associated with each value p of P and its probability Pr(p). Show the model needed 
to determine the target T that maximizes total expected revenue. 
Defining non-negative deviations from target T, let 


pi = T—D; + E; foralli. 


The probability of each Di and Ei will be equal to the probability Pr (pi). Expected 
loss will therefore be: 

Expected Loss = X; Pr(pi) [21 Di + 3 Ei] 

Revenue from target R(T) = —5Z + 16 T1 + 8 T2 


T =T1+T2 


T1 < 13Z, T2<99Z, Zisa 0,1 variable 


Thus, the LP model: 

Maximize —5Z + 16 T1 + 8 T2 — Xi Pr(pi) [21Di + 3Ei] 
Subject to: 

TI <13Z 

T2 < 99 Z (99 represents any number greater than each pi.) 
Z is a 0,1variable. 

T=T1+T2 

pi=T-Di-+ Ei for alli. 


5. Licenses: 


The State allocates hunting licenses to a store that sells them for $100 each. The 
demand for licenses is uniformly distributed between 10 and 30. At least 10 will 
be demanded and at most 30 will be demanded at that store. 


(a) Define the expected income function associated with any allocation ‘x’ of 
hunting licenses. Sketch the function. 

(b) Assume there are two stores, but the demand distribution at the other store is 
uniform between 5 and 15. If only 25 licenses are to be allocated, how many 
licenses should be allocated to each store that will maximize total expected 
income? 


(a) Let w be the number of sales. 


Expected income = $5w for w < 5 
w—5 


= $5 5+ f a -o1wdw 5<w<15 
0 
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10 
= $5 5+ f (1 —o.1waw =50 w2=15 
0 


Income 


0o142 38@ 5 678 8 © tt 122 pnus 
Demand 


(b) Assuming there are two stores, and the demand distribution at the other store is 
uniform between 5 and 15. If only 25 licenses are to be allocated, determining how 
many licenses to allocate to each store that will maximize total expected income: 

Let x be the allocation to store | and y be the allocation to store 2. Just consider 
the remaining 10 licenses after 10 and 5 have been allocated to stores 1 and 2. 

Maximize 100 [fod — zp dx + Ra — Ddy] Subject to x + y <10. Hence: 
maximize (x — x7/40 + y — y*/20) where x + y< 10. 

Or equate the slopes: 1 — x/20 = 1 — y/10 where x + y = 10. Hence x/2 = 
(10—x) or x = 10/1.5 = 20/3 = 6.667 and thus y = 3.333. 


13. Stochastic Processes 
e Weather prediction. 


The mayor is considering having a $100-dollar a plate dinner to increase the funds 
available for the homeless. His problem is that he doesn’t know how many people 
might come. Experience suggests that it largely depends on whether it rains or not. 

The probability of a dry day depends on the past day’s condition. The local 
weather service has provided the following conditional probabilities of dry and 
wet days: 


Dayt+1: Dry Wet 
Day t : Dry 0.80 0.20 
Wet 0.47 0.53 


Invitations must be sent out two weeks in advance. 
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(a) What is the probability of the selected day being a dry one? 


Assuming a steady-state condition has been reached in two weeks: 
Solve the following simultaneous equations: 
P(dry) = P(dry)0.8 + P(wet)0.47 and/or P(wet) = P(dry)0.2 + P(wet)0.53. 
P(dry) + P(wet) = 1. 
Solution: P(dry) =0.7, P(wet) = 0.3. 


(b) Should the guests be encouraged to bring an umbrella? For this problem make 
up convenience ‘benefits or costs’ for each possibility: For example, if it is dry 
and they do not bring an umbrella, or if it is wet and they bring an umbrella, 
the benefit can be 10, If it rains and they do not have an umbrella, the benefit 
is —10. Otherwise, it is —5. 


Let pdn be the probability having no umbrella on a dry day. Pdy is the probability 
of having an umbrella on a dry day. 

Similarly for wet days. Pd is the probability of having a dry day; pw is the 
probability of the day being wet. 


pd = (pdn + pdy)*.8 + (pwn + pwy)*.47 
pw = (pdy + pdn)*2 + (pwy + pwn)*.53; 


pd = pdy + pdn; 
pw = pwy + pwn, 
pd + pw =1, 


maximize pdn* (10*.8 — 10*.2) + pdy*(—5*.8 + 10*.2) 
+ pwy*(—5* — 47 + 10*.53) + pwn*(10*.47 + —10*.53); 


Objective value : 5.089552 


Variable Value Reduced Cost 
pd 0.7014925 0.0000000 
pdn 0.7014925 0.0000000 
pdy 0.0000000 15.00000 
pwn 0.0000000 20.00000 
pwy 0.2985075 0.0000000 
pw 0.2985075 0.0000000 


This shows if in a dry day the best policy is not to bring umbrella. If in a wet state 
‘Yes’, bring an umbrella. 
Consider using Dynamic Programming: 
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falon l s [ov aom 
Yes_ 0.8 10 


ha SA oe a 


0.2 ~ 
Yes _ ‘ \ 
710 10 


No 0.53 
Feea(Wet] Z : a ee FWet) 
>10 


Let F;(i) be the maximum expected benefits given state i with t periods (stages) 
to go to the last stage. 

Assume FO(i) =0 for each state i. 

F;41(Dry) = max { Y: (F,(Dry)-5)0.8 +(F,(Wet) +10)0.2, N: (FDY) + 
10)0.8 + (F,(Wet)—10)0.2}. 

Fy 41(Wet) = max { Y: (F;(Dry)-5)0.47 +(F:(Wet) + 10)0.53, N: (F;(Dry) + 
10)0.47 + (F(Wert)—10)0.53}. 

Successive values of F (Dry) and F (Wet) are shown below along with the optimal 
policy: 


Time t F;(Dry) F;(Wet) 


0 0 0 

1 6N 2.95 Y 
2 11.4N 7.33 Y 
3 16.6N 12.20 Y 
4 21.7N 17.20 Y 
5 26.8 N 22.26 Y 
6 31.89 N 27.35 Y 


Notice the successive differences F \(state)—F;(state) are converging on 5.09 
for both Dry and Wet states. This is the expected income also found using the linear 
programming model shown above. The optimal policies identified by the LP and DP 
models are the same. 


e Gambling 


You are given opportunity to begin with an investment of $1 in a succession of 
gambles where in each iteration there is a 90% chance of doubling your money 
and a 10% chance of losing all your money. Hence if you win the first three gam- 
bles you will have $8. You can quit playing at any time. What are your expected 
earnings and the probability of having all of them for successive iterations, and 
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when, and why, would you stop playing? 


Solution :Probability of winning n successive gambles = 0.9", 
and earnings would be 2"$ 


Expected earnings = $2"0.9". 


0.9%n 
24n*0.94n 
1 
400 
% 0.8 % 
z oë E 300 
23” D 
5 200 
2 04 Ð 
Q © 
o (7) 
= 02 s 100 
uJ 
0 0 
423456783910 12345678910 
Number of gambles Number of gambles 


e Crime Reduction 


A community center provides recreation facilities for young people. Among the 
benefits to the community are lower crime rates. Assume there are two states of 
crime rates—low (L) and high (H). Observed crime rates over time show that if the 
crime rate is low in any month, the probability of having a low rate the following 
month is 0.5. The probability of having a high-rate month following a low-rate 
month is 0.5. If the crime rate is high in a month, the probability of a high rate 
the following month is 0.9, and thus the probability of a low rate the next month 
is 0.1. These probabilities apply if the community center does not advertise. This 
is the ‘do-nothing’ policy. (Policy n). These conditional probabilities are shown 
in Fig. 1. However, if the center advertises its recreation programs, (policy a) the 
conditional probabilities change to those shown in Fig. 2. 

The community center can change its policy at the beginning of each month. 
The high crime month costs 20 more than the low crime month, and advertising 
costs 10 per month. 
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Month t+1 Month t+1 
Policy n: L H Policy a: L H 
Montht: L 05 0.5 Montht: L 0.8 0.2 
H 01 09 H 0.6 0.4 
Fig 1. Fig 2. 


Show how you would determine what policy to implement following each type 
of month (low or high crime rate) to minimize the total expected cost of crime and 
advertising expense. 

Hint: You can use the network below if you wish. Work backward. Stop when 
the minimum cost policies (decisions) remain the same in two successive months. 


DADA A 
Sac a PO NG 


You can use the network to solve for the steady-state policy that doesn’t change 
given the state (H or L) over time. Or you can use excel to solve the dynamic pro- 
gramming problem represented by the network above, or a linear program where 
the variables are the joint probabilities of states and decisions. 

Let F(L,0) be the least-cost to continue 0 periods into the future from state L = 
0. 

Let F(H,0) be the least-cost to continue 0 periods into the future from state H = 
0. 
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F(L, 1) = Min[(F(L,0)*0.5 + (F(H,0)+20)*0.5), (F(L,0)*0.8 + (F(H,0] +20)*0.2+10)] 


N (F(L,0)*0.5 + (F(H,0) + 20)*0.5) |= 10 
A (F(L,0)*0.8 + (F(H,0)+20)*0.2+10) | = 14 
min = = 10 n 10 


F(H, 1) = Min[(F(L,0)*0.1 + (F(H,0)+20)*0.9)), (F(L,0)*0.6 + (F(H,0) + 20)*0.4) +10)] 


n (FEL, 0)*0.1+ (F(H,0) +20)*0.9) ) | = 18 
a (F(L, 0)*0.6* (F(H,0) + = 18 
20)*0.4)+10) 
min= = 18 n,a 18 


F(L,2) = Min[(F(L, 1)*0.5 + (F(H,1)+2)*0.5) , (F(L,1)*0.8 + (F(H,1) + 20)*0.2 +10)] 
n (F(L,1)*0.5 + (F(H,1) + 20)*0.5) |= 24 
a (F(L,1)*0.8 + (F(H,1) + 20) *0.2 | = 25.6 
+10) 
min = = 24 n 14 


F(H, 2) = Min[(F(L, 1)*0.1 + (F(H,1) + 20)*0.9)), (F(L,1)* 


(FL, 1)*0.5+ (F(H,1) + 20)*0.9) ) 


0.6 + (F(H,1) + 20)*0.4) +10)] 


(FL, 1)*0.6 + (F[H,1) + 20)*0.4) 
+10) 


31.2 a 13.2 


F(L, 3)= Min[(F(L,2)*0.5 + (F[H,2)+20)*0.5), (F(L,2)*0.8 + (F(H,2)+20)*0.2 + 10)] 


n (F(L,2)*0.5+ (F(H,2) + 20)*0.5) = 37.6 
a (F(L,2)*0.8+(F(H,2)+20)*0.2+10) |= 39.44 
min= = 37.6 n 13.6 


F(H,3) = Min[(F(L,2)*0.1 +-(F(H,2)+20)*0.9)), (F(L,2)*0.6 


+ (F(H,2) + 20)*0.4) +10)] 


n (F(L,2)*0.1 + (F[H,2) + 20) *0.9)) | = 48.48 
a (F(L,2)*0.6+ (F(H,2) + 20)*0.4) + | = 44.88 
10) 
min = = 44.88 a 13.68 


F(L,4) = Min[(F(L,3)*0.5 + (F(H,3) + 20)*0.5), (F(L,3)*0.8 + (F(H,3) + 20)*0.2+10)]] 


n (F(L,3)*0.5+(F(H,3)+ 20)*0.5) = 51.24 

a (F(L,3)*0.8+(F(H,3)+20)*0.2+10) |= 53.056 

min = = 51.24 n 13.64 
F(H,4) = Min[(F(L,3)*0.1 + (F[H,3)+20)*0.9)), [F(L,3)*0.6 + (F(H,3) + 20)*0.4) + 10)] 

n (F(L,3)*0.1+ (F(H,3) +20)*0.9) ) |= 62.152 

a (F(L,3)*0.6 + (F(H,3) +20)*0.4) + | = 58.512 

10) 
min = = 58.512 a 13.632 


F(L, 5) = Min[(F(L,4)*0.5 + (F(H,4) + 20)*0.5), (F(L,4)*0. 


8 + (F(H,4) + 20)*0.2+10)] 


n 


(F(L,4)*0.5 + (F(H,4) + 20)*0.5) 


64.876 


a 


(F(L, 4)*0.8+ (F(H,4) + 20)*0.2+ 
10 


66.6944 
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min _ = 64.876 n 13.636 
F(H,5) = Min[(F(L,4)*0.1 + (F(H,4) + 20)*0.9)), (F(L,4)*0.6 +(F(H,4)+20)*0.4)+10)] 

n (F(L,4)*0.1 + (FH,4) + 20)*0.9)) | = 75.7848 

a (F(L, 4)*0.6+ (F(H,4) + 20)*0.44+ |= 72.1488 

10) 

min = = 72.1488 |a 13.6368 

Solution: You should advertise if in state H 
Converging to 13.64 


The expected minimum monthly cost is 13.64 and the policy is ‘w if in state L 
and ‘a’ in state H. 


14. Chance Constrained and Monte Carlo Modeling 
1. Chance constraints and Monte Carlo simulation. 


Consider an “allocation problem,” but with chance constraints on meeting random 
demands Dj at demand sites j. For example, if the allocation Aj is to meet or 
exceed the demand Dj at site j at least 95% of the time, the chance constraint is: 


Pr{Aj > Dj} > 0.95 
The deterministic equivalent is. 
Aj = ap? where ap? is the demand that is exceeded only 5% of the time. 


Assume the cumulative distribution of demand d is d/(1 + d). This is the prob- 
ability that the actual random demand will be less than d. When d is 0, the 
cumulative probability is 0. The probability is zero that the actual demand will 
be less than 0. As d increases, the probability that the random actual demand will 
be equal or less than d approaches 1. Therefore, dos the demand that will be 
exceeded only 5% of the time, can be computed. The actual allocation, Aj, must 
be at least this amount to satisfy the demand at least 95% of the time. 

The demand (dj??5) whose probability of being at least equal to the actual 
demand 95% of the time, is determined by setting the cumulative distribution to 
0.95. 


0.95 = d/(1 + d); d = 0.95 + 0.95d thus d = 0.95/0.05 = 19 
The deterministic equivalent of the chance constraint is Aj > qe = 19. 
(a) Define the deterministic constraints for: 


(i) Pr{Aj>Dj}=0.8 Solution: Aj>4 
Gi) Pr{ Aj <Dj} <0.10 Solution:Aj> 9 
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(ii) Pr{ Aj > Dj} < 0.50 Solution:Aj < 1 

(b) Generate a series of random uniformly distributed probabilities and their cor- 
responding values of demand d. The proportion of d values less than or equal 
to 19 is a way to see if the minimum allowable allocation of 19 will satisfy 
the random demand at least 95% of the time. Now you can also check on your 
answer to (i) and (ii) above as well. 


Solution: Generate random numbers p uniformly distributed from 0 to 1. Compute 
the associated value of d (d = pi(1—p). If d< 19, (or p<0.95) generate a 1, oth- 
erwise 0. Do this for a large number, n, of times. Add up all the \’s and divide by 
n. This will be the probability of the demand being met if the allocation is 19. 


2. Consider an allocation problem where the supply of resources available for 
various users in each time period is uncertain. Assume the supply’s probability 
distribution in each time period is uniform between 5 and 15. Users want to 
know the tradeoff between what allocation they can count on and its reliability. 
If your objective when allocating the available resources is to minimize the 
maximum percentage deficit between what each user wants and what they get, 
or equivalently their maximum level of satisfaction, show the model you would 
use to generate the information they desire. 


Solution: 
Let x(i) be each user’s allocation. 
T(i) be their desired target allocation. 
S be the total supply that is random. 
R be the desired reliability. 
Maximize Satisfaction, 
subject to: 
Satisfaction < x(i)/T (i) for all users i. 
Pri x(@i) <S)=R whose deterministic equivalent is 


X xO S59 R=1, 
i 
XO x(i) = 10(1.5 — R) otherwise 
i 
Recognizing that each x(i)/T(i) will be equal, let X be the sum of all x(i) and T 
be the sum of all T(i). XIT will equal each user’s level of satisfaction. The Excel 


display below shows the tradeoffs between R and X/T assuming T is 10. There is no 
need to optimize. 
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x R x/T 
1 1 0.1 
2 1 02 Tradeoffs with Target = 10 
3 1 0.3 
4 1 0.4 
5 1 0.5 
6 0.9 0.6 8 
7 0.8 0.7 ii 
8 0.7 0.8 
9 0.6 0.9 04 
10 0.5 1 as 
11 0.4 1 
12 0.3 1 0 
13 0.2 1 1 4 5 6 9 10 1 13 4 15 
14 0.1 1 —k XT 
15 0 1 


3. Monte Carlo sampling. 
(a) Show how you would generate equally likely values of the random variable 
X that have the following probability distribution: 


fx(x) 


Solution: 

Generate values of p(t) that are uniformly distributed from 0 to 1. 
Assume p = value of cumulative distribution of f x(x) = x/200. 

Let each p = x/200 dx = x2/400. Hence x(t) =20 p(1)0.5 

Note: If you assume a uniform rectangular one, again from 0 to 20. 
fX(x) =0.05. p = FX(x) =0.05x. Thus x(t) = 20p(t). 


(b) Show how to compute the mean or expected or average value, and the variance, 
of n discrete x(t) values randomly generated from this probability distribution. 
Compare these values with the true values of the mean and variance. 


Solution : 


n 


Mean = > x(t)/n 


t 


308 Exercise Solutions 


Variance = ) > (x(t) — mean)” /n 


t 


20 
2 
x 
T i —— dx = 13.33 
rue mean is f 0” 
0 


20 
= 13.33)? 
True variance is J Gere = 22.22 
200 
0 


4. Consider a random variable X that has the following discrete probability 
distribution, ranging from 2 to 5. 


(a) Describe how to generate multiple discrete values, x(i), of the random variable 
X that fit this distribution. 


Generate random values p(i) uniformly distributed between 0 and 1 (say using = 
RAND() in Excel. Then follow the rules below. 

If pi) <0.2, x(i) =2; if 0.2<p(i) <0.5, x@) =3; if 0.5<p@ <0.8, x@ =4; if 
P(i)> 0.8, x(i) = 5. 


(b) Write the equations for calculating the mean and variance of all the n values 
you obtained. 


Mean = (1/n) SEO) Variance = (1/n) (x(i) — mean)” 


i=1 i=1 


5. You are having to decide how many trucks you need to purchase and drivers 
you need to hire to pick up trash each day. Between 10 and 30 truck-day units 
of trash are produced each day, and these amounts are uniformly distributed. All 
the trash must be picked up each day. Each truck can haul enough to bring in 
$ 600 per day. However, for each day a truck and driver are idle because there 
is not enough trash to pick up, the loss is $ 800 per truck. If private contractors 
must be hired to pick up any excess trash, the cost is $ 200 per truck per day. 


Exercise Solutions 309 


Example: If 20 trucks are available (the target) and only 18 are needed the net 
income is 20(600)—2(800). If 22 trucks are required, the net income is 20(600)— 
2(200). 


(a) Describe how to determine the most economical target number of trucks to 
buy using Monte Carlo sampling. 


Generate a set of n uniformly distributed probabilities p(i) ranging from 0 to 1. 

Compute each p(i)’s corresponding trash generation x(i) value that is derived 
from the trash probability distribution: x(i) = 10 + 20(p(i)). 

Select a target value T and then calculate the net income, NI(i) associated with 
each x(i) using 600(T)—max(800(T—x(i)), 200(x(i)—T))). 

Calculate the mean net income: (1/n))~"_, NI (i) = NI. 


Select another target T and repeat. Find the target T that maximizes the mean NI. 
Un this case the best T is 26.) 

Develop and solve an optimization model for finding the number of trucks to 
buy that maximizes expected net income. 


Maximize 600 T-800 f(T —x(i))/20dx-200 f3 (i) —T)/20dx. This will result 
in T =26 and an expected net income of $10,400. 

Cumulative distribution value = (6 + 2)/(8 + 2) = 0.80, hence T =26 is 80% 
reliable. 


(c) If you wanted to be sure that your target number of trucks would be able to 
pick up all the trash produced at least 90% of the time, what would be the 
target number? 


Pr(T>X)>= 0.90 is equivalent to T> 28 since x exceeds 28 only 10% of the time. 
15. Simulation Modeling 


1 Bus replacement 


Every year 5% of the passenger buses in a town need to be replaced due to obso- 
lescence and no longer meeting safety and environmental standards. Current plans 
and budget constraints call for the purchase of 10 new busses each year. How many 
busses must the bus company have if these rates of change can be sustained? Is 
this equilibrium stable? 


Bist = B; (1 — 0.05) + 10 so if stable By); = B,. Hence B(0.05) = 10 and 
thus B = 200. 

Check: If B = 200 now, next year it will be 200(.95) + 10 = 200. 

If B is 100 now, next year it will be 100(.95) + 10 = 105. It is increasing. 
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If B is 300, next year it will be 300(.95) + 10 = 295. It is decreasing. Thus, the 
equilibrium is stable. 


2. Controlling algal blooms 


In many lakes algal blooms are an increasing hazard. They are often caused by 
excessive phosphorus, P, entering the lake. 

Consider a small lake having a constant volume V cubic meters. Thus its inflow 
Q equals its outflow Q. Currently the mass of phosphorus entering the lake is P 
kg per day. The daily amount of phosphorus decay per unit phosphorus mass in 
the lake is the decay constant k. Each of these values, V, Q, P, and k, are known. 

The daily change, dM/dt, of phosphorus mass, M, in the lake depends on the 
daily mass of phosphorus entering the lake, P, the mass of phosphorus that exits 
the lake in the outflow, QM/V, and the mass of phosphorus that decays in the lake, 
kM. This change in lake phosphorus mass can be written: 


dM/dt = P — QM/V — kM 


(a) Suppose the initial lake nutrient mass at the beginning of day 1, M(1), is 0. 
Given a constant mass of phosphorus, P, entering the lake each day beginning 
in day 1, show how you could determiine the mass of phosphorus, M(t), at the 
beginning of each following day t. 


M(t+ 1) = M(t) + [P— QM(t)/V — kM(t)]At where At = 1. 
Solve this equation for successive days t starting when t=1 and M(1) = 0. 


(b) Will the phosphorus mass in the lake reach an equilibrium, and if so what is 
it? (express as a function of V,Q, P, and k.) 


When dM /dt = 0, equilibrium mass M = P/((Q/V) +k) 

Suppose the phosphorus entering the lake, P, can be reduced by X percent, This 
would cost C(X). How could you define the tradeoff between this cost, C(X), and 
the equilibrium phosphorus concentration, M/V, in the lake? 

Pick various values of X and solve for corresponding equilibrium concentra- 
tions, M/V, and costs, C(X). 


Equilibrium concentration = M/V = P(1—X)/(Q+kV). 
3. Forest sustained yield 


One measure of the amount of forest growth in the watershed is the basal area 
of trees. This is the cross-sectional area of the trunk near the base of the tree. 
For both hardwood and softwood species the increase in basal area per hectare 
per year is directly proportional to the initial basal area of that species. However, 
this potential increase in basal area is reduced by the loss in basal area due to 
competition from its own species and from the other species. 
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Let. 

H(y) Basal area of hardwoods per hectare at the beginning of year y. 

S(y) Basal area of softwoods per hectare at the beginning of year y. 

a, Basal area growth per unit basal area per hectare for species type t. 


at Basal area loss per unit of basal area of species type t per unit basal area of 
same species per hectare. 


b Basal area loss per unit of basal area of species type t per unit basal area of 
different species per hectare. 


Equations that describe the changes in basal area over time for both tree species 
can be written. 


dH/dy = raH(y) — anH (y)? — byH(y)S(y) 


dS/dy = r.S(y) — asS(y)” — bsH(y)S(y) 
Assume ry = 0.3; rs = 0.5; ag = 0.1; as = 0.1; by = 0.05; b; = 0.05. 


If this forest is to be managed in a sustainable way to obtain a constant harvest of 
hardwood and softwood in each year, create a model to determine how much of 
each type of species can be harvested each year depending on the relative value 
per unit basal area of hardwoods compared to that of softwoods. 

Model: 

Let CH be the harvest of hardwoods, and CS be the harvest of softwoods. 

Maximize CH + v*CS. 

H + CH = (1 + rh) *H — ah*H*H — bh*S*H. 

S+ CS = (1 + rs)*S — as*S*S — bs*S*H. 

rh = 0.3; rs = 0.5; ah =0.1; as =0.1; bh =0.05; bs = 0.05. 


Solution: 
Ifv=0 Ifv=0.5 Ifv=1 If v = 99 
Obj = 0.225 Obj = 0.3565 Obj = 0.633333 CH 0.0000000 
CH 0.2250000 CH 0.0986767 CH 0.0500000 CS 0.6250000 
H 1.500000 CS 0.5156900 CS 0.5833333 H 0.0000000 
H 0.7826084 H 0.3333334 S 2.500000 
S 1.913044 S 2.333333 
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16. Multi-Criteria Analyses 


1 Weighting and constraining multiple objectives 


(a) Express the following model in a form used for defining the efficiency fron- 


tier (tradeoff between the two objectives) using the weighting method and 
the constraint method. 


Maximize zı = 4x; — x2 
Maximize z2 = —2x1 + 6x2 
Subject to xı < 4 
xı +x2 <6 
x1,x2 = 0 
Weighting method: Maximize w, z1 + w2 z2 or (w1/16) z1 + (w2/36) z2. 
Constraint method: Maximize zı Subject to z2 > L. 
Subject to same constraints on x and definitions of zı and z2 in terms of x. 


Select different values of weights or L. Note: w1 + w2 = 1. 


(b) Plot the efficiency frontiers in decision and objective spaces. 
(b) 


Decision space (x1, x2) 


X 


Objective space (zı, z2) 
x, = 0 


Efficiency Frontier 
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2 Resource allocation 


Consider again the resource allocation problem where three users obtain benefits 
B(X) from the resources X they get allocated to them. The functions B(X) and 
their maximum values are shown below. 


By(X) = 6X, -X2 > X= 3 and BE = B(x") =9 
Bp (Xp) = 7X2 — 1.5X2 > X" =7/3 and BE* = Ba(x3”") = 147/18 


B3(X3) = 8X3 — 0.5X? > Xj" =8 and BY = B3(X%") = 32 


Instead of finding the values of each allocation that maximizes the total bene- 
fits, assuming only 6 resources are available, each user wants to maximize their 
own benefits. This is now a multi-objective problem. Show how to find the 
tradeoffs among each user using the weighting, constraint, goal attainment and 
goal-programming methods. 


Weighting method: 


Objective : max w £ 1) +W aa TG | 


Subject to: Xj + X2 + X3 < 6 


Constraint method: 
Objective: max{B3(X3)} 
Subject to: X; + X2 + X3 <6 
Bi (x1) =a 
Bo(X2) = B 


Goal Attainment method : 
Objective : min{D} 
Subject to : Xj + X2 + X3 <6 
Wi 9—Bi(X1) < D 


147/18—By(X2) 
W2 147/18 <D 


W3 328i 3) < D 
Goal-Programming method: 


Goal - Programming method: 
Objective : min{Z; (D1) + L2(D2) + L3(D3)} 
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Subject to : Xı + X2 + X3 <6 
9 — Bı (X1) < Dı 
1 — Ba(X2) < Do 
32 — B3 (X3) < D3 


3. More multi-objective modeling 
Consider the following multiple objective optimization problem: 


Maximize Z1. 
Maximize Z2. 


Zl = 2X. 
Z2 = 3Y. 
X? + Y? < 16. 


Show how you could use the weighting and constraint methods to identify the 
tradeoff among various maximum values of Z1 and Z2. 

Weighting method: Maximize wl Z1 + w2 Z2 and vary the weights to define 
points on the tradeoff frontier. 

Constraint method: Maximize Z1 subject to Z2>L and vary the lower bound 
L between 0 and maximum Z2 (= 3*4) to define points on the tradeoff frontier. 


Z1 vs Z2 


w2/w1 = 0.53/0.47 
x| L=10.3 


Example outputs of models 
using the weighting and 
constraint methods. 


Y 3.333333 


Goal Attainment method: Minimize D subject to w1(T1 - Z1) <D; w2(T2 - 
Z2) < D; for selected objective target values T and varying weights w. For example, 
assume T1 and T2 are both 10. T1 is more than can be obtained and T2 is less 
that can be obtained. For varying combinations of weights, the solutions are: 


Variable Value Value Value Value 
Z1 6.656402 2.237806 8.000000 5.402349 
X 3.328201 1.118903 4.000000 2.701175 
Z2 6.656402 10.69097 0.000000 8.850587 
Y 2.218801 3.563656 0.0 2.950196 
D 1.671799 0.0 2.0 0.9195301 
Wi 0.5 0.0 1.0 0.2 
W2 0.5 1.0 0.0 0.8 
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Z1 vs.Z2 


Z2 


Feasible 
values 


Z1 


17. Fuzzy Optimization 


1. Consider the problem of heating a swimming pool. You are told to maintain the 
right temperature, T, and not spend too much money, C(T), doing it. How might 
you develop a fuzzy model for determining the ‘best’ temperature and cost? 
Assume you know the cost function C(T). Draw and quantify the membership 
functions and develop the optimization model that maximizes the minimum 
membership value. 


Possible solution. 


Mr 1 
(0) 
a b c d Temperature T 
t1 t2 t3 t4 
Mc 1 
0 e E 
c1 c2 c3 Cost C(T) 


To simplify, assume the solution is within the concave part of each membership 
function, 

(Otherwise, binary variables must be used and constraints for T and C(T) need 
changing.) 
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Maximize D Let sti and sci be the slopes of linear functions in segment i 


Mr > D Mr = stl tl + st2 t2 + st3 t3 + st4 t4 
tl <a, t2 < b-a, t3<c-—b, 
tl +t2+t3+t4=T 
Mc > D Mc = 1+ scl cl + sc2 c2 + sc3 c3 
cl <e, c2<f-e, 
cl + c2 + c3 = Cost (T) 


2. Water quality management 


Exercise 7 in Chap. 7 involved finding the ‘least-cost? amounts of wastewater 
treatment (treatment efficiencies) at sites 1 and 2 that meet stream quality standards 
at sites 2 and 3: Currently there is no treatment. All the wastewater is discharged 
into the stream. 


Wastewater: 
100 kg/day 


Wastewater: 


200 kg/day 
Current Pollutant Concentrations: 58 mg/l 95 mg/l 
Maximum Allowable Concentrations: 18 mg/l 23 mg/l 


Available Data: 

Stream flow = 1000 m?/day at all sites. 1 kg/day/1000 m?/day = 1 mg/l. 

Fraction of waste discharged into stream at site 1 that reaches site 2: 0.25. 

Fraction of waste discharged at site 1 that reaches site 3: 0.15. 

Fraction of waste at and discharged into stream at site 2 that reaches site 3: 
0.60. 

Limits of treatment: removal of 30% required, but no more than 90%, for both 
sites. The initial concentration just upstream of site 1 is 32 mg/l. 

Assume the costs of waste removal are 30*fraction removed at site 1 and 
20* fraction removed at site 2. 

Can you find a solution that “keeps the stream clean yet doesn’t cost too much”? 


Model : 
Cost = 30*x1 + 20*x2. 
Quality at site 2. 
(32 + 200*(1 — x1))*0.25 < P; 
Quality at site 3. 


Exercise Solutions 317 


(32 + 200*(1 — x1))*0.15 + 100*(1 — x2)*0.60 < P. 
Treatment restrictions. 
x1 < 0.9; x2 < 0.9; 
xl > 0.3; x2 > 0.3 


Membership functions: a = 30, b = 50, 1 
Mc 
d= 15, e = 25, 


Mc = 1 - cc2/(b-a). 


Cost = cel. +cc2; ccl Sa: 


S 
Mp = 1 — p2/(e-d), p1 < d. Mp 


P=p1+p2 0 


Maximize M; MS Mc; M < Mp. 


Model solution : 


Variable Value 

cost 40.09756 
P 20.04878 
xl 0.7590244 
x2 0.8663415 
M 0.4951220 
Mp 0.4951220 
Mc 0.4951220 
cc2 10.09756 
ccl 30.00000 
pl 15.00000 
p2 5.048780 


Miscellaneous 


1. How many places on the earth’s surface can a person travel 1 km south, then 
1 km east or west, and finally 1 km north and end up at exactly where they 
started? 
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Infinite. The north pole and anywhere on a parallel of latitude 1 km north of another 
parallel of latitude exactly 1/n km in circumference where n is any integer > 0. These 
parallel of latitudes would be just north of the south pole. 


2. The diagram below shows a rectangular room of dimensions 12 x 12x30 feet. 
On the inside surface of the end walls are two bugs. One is 1 foot up from 
the base and the other is 1 foot down from the top, and both are 6 feet from 
either of the side walls. They would like to meet each other. What is the shortest 
distance they can travel on the inside surface of the room to meet? (They cannot 
fly.) The answer is less than the straight path of 11 + 30+ 1 = 42. 


A 3(24), 4(32), 5(40) 
Triangle 


3. A horn is created by rotating the function 1/x about the x axis from x = 1 to x 
= oo. How much paint would you need to paint the inside surface of the horn? 
Hint: To find the surface area integrate the circumference 2 vx r, where r = 1/x, 
from x = 1 to oo. You will find the surface area to be infinite. Yet the amount 
of paint you need is finite. 


4 Surface: J,° 2 2/x dx = 2n[In(o) —In(1)] =% -0 =% 


Volume: f1” 1/x? dx = n [-1/% - (- 1/1)] = 2 units of 
paint 


1/x 


Thus fill the glass with x units of paint and throw 
out that which doesn’t stick to the (infinite) side 
surface. 


4. Types of Averages: 
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How do you compute the average value of different discrete data? 

If someone wants to find and average value of a data set most will think of 
computing the arithmetic average, also called the mean value. 

Assume a data set {x(i)} i= 1, 2,...,n 

The arithmetic mean, AM, is the sum of all n x(i) values divided by n. This 
assumes each x(i) is equally likely. 

More generally, AM is the sum from | to n of the products x(i)*p(i) where the 
sum of all p(i) = 1. 


n n 
AM = Yo x(/n or X xp 
i=1 i=1 
In some cases, the geometric mean is a more accurate estimate of the average 
or mean value. The geometric mean, GM, is the nth root of the product of all n 
values of x(i) 


Another average is the root mean squared, RMS. This is the square root of the 
sum of n values of x(i) squared divided by n. 


i 0.5 
RMS = bs s/n 


i=1 


Finally, there is the harmonic mean, HM. This HM is n divided by sum of 1/x(i) 
or 1/377 (1/x(i))/n. 


n n n 

HM = of a] or X` aco] wos] 
i=1 i=1 i=1 

The values of these four different means have the following relationship. 


RMS > AM > GM > HM. 


Example: 

n=6 

Data : Variable Value 
X(1) 4. 
X(2) 3. 
X(3) 8. 
X(4) 5. 
X(5) 9. 
X(6) 1 
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(a) Calculate each of the four averages of these data. 


AM 5.000000 
GM _ 4.035654 
RMS 5.715476 
HM 2.971114 


(b) Consider computing your average income tax rate over a number of years. 


AM =(1/n) D (totaltax(y)/totalincome(y) gives lower weights to high tax rates. 
For example: AM = 


((14, 000/100, 000) + (20, 000/300, 000))/2 = 0.103 = 10.3 % 
14% 6.67% 
alternatively: 


n n 
X (otal tax)/ È (total income) = 34, 000/400, 000 = 0.085 = 8.5% 


y y 


Which method is correct? 


(c) Finding the average annual rate of return i given various annual interest rates 
r(y). 
d+i" =(1 + rA + r(2))....(1 + r(n)) where i is GM interest rate. 
Example: GM: (1 + i= (1 + 0.1) + 0). i = 1.10.5-1 = 0.0488 not 
0.05. 
(d) Finding the average speed of a vehicle over a given distance. 


If the speed over the first 100 km is 40 km/hour, and the speed over the next 
100 km is 60 km/hour, the average speed is not the arithmetic mean, AM = 
50 km/hour. If that is not obvious, consider a speed of O for the first 100 km, 
and a speed of 60 for the final 100 km. The average AM is 30 when in fact the 
vehicle would never reach the second half of the journey. 

The average speed is the harmonic mean, HM = total distance/total time = 
200/(100/40 + 100/60) = 48 km/hour. 


5. What physical part of a train or trolly goes backward when the train or trolly 
goes forward? 
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Any part of the flange of the wheel that extends below the surface of the track, as 
long as it remains below that surface. 


6. A jogger arrives at a railroad station an hour earlier than when her chauffeur 
usually picks her up to go to her home. Not being able to call her chauffeur she 
starts jogging at 6 miles per hour. She meets the chauffeur going the other way. 
He picks her up and drives her to her home. They arrive at her home 20 min 
before they usually get there. How fast does the chauffeur drive? 


The 20 min saved is the time the chauffeur drives from where she was picked 
up to the station and back, or 10 min each way. Had she kept jogging for 10 more 
minutes the car would have reached the station one hour after she started jogging. 
Hence, she jogged 50 min before being picked up. At 6 mph she jogged 6 (50/60) = 
5 miles. The chauffeur drives those 5 miles in 10 min or at 30 mph. 
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